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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME; :r-i:csr.ecics sa . 

(B) STREET: Indus tnep ark Zwijcaarde 7, box 4 

(C) CITY: Ghent 

(S) COUNTRY: Belgium 

(F) POSTAL CUE (21?) : 3-9052 

(G) TELEPHONE: 00 32 9 241 07 II 

(H) TELEFAX: 00 32 9 241 07 99 

(ii) TITLE Or INVENTION: New sequences of hepatitis C virus genotypes 
for diagnosis, prophylaxis and therapy. 

(iii) NUMBER Or SEQUENCES: 270 

(iv) COMPUTER - READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
{3} COMPUTER: I3M PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DCS 

(D) SOFTWARE: Patently Release #1.0, Version £1.25 (EPC) 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(3) CLONE: 3R34-4-20 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . -213 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTC ACG GAA COG CTT TAC TGC GGG GGC CCT ATG TTC AAC AGO AAG GGG 
Leu Thr Glu Arg Leu Tyr Cys Giy-Gly Pro Met Phe Asn Ser Lys Gly 
15 10 15 

GCC CAG TGT GGT TAT CGC CGC TGC CGT GCC AGT GGA GTT CTG CCT ACC 
Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 
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20 25 ' 30 



AGC TTC GGC AAC ACA ATC ACT TGC TAC ATC AAG GCC ACA GCG GCT GCA 144 

Ser Phe Gly Asn Thr He Thr Cys Tyr He Lys Ala Thr Ala Ala Ala 
35 40 4S 

AGG GCC GCA GGC CTC CGG AAC CCG GAC TTT CTT GTC TGC GGA GAT GAT 192 

Arc Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cvs Glv Asc Asc 
50 55 go 

CTG GTC GTG GTG GCT GAG AGT 213 

Leu Val Val Val Ala Giu Ser 
65 70 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 71 amino acids 

(B) TYPE: amino acid 
^ (D) TOPOLOGY: linear 

J? (ii) MOLECULE TYPE; protein 

S ^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 • 

flf Leu Thr Glu Arg Leu Tyr Cys Gly Glv Pro Msi Phe Asr. Ser Ivs Glv 

ru 1 s " ' ' * io is 

K " Ala Gln Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 

20 25 30 

tassF 

^ Ser ?he G1 Y A sn Thr He Thr Cys Tyr He Lys Ala Thr Ala Ala Ala 

TZ 35 40 45 



Arg Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp 
50 55 50 

Leu Val Val Val Ala Glu Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 



(vii) IMMEDIATE SOURCE: 

(3) CLONE: BR36-23-19 

Cix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..213 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 

CTC ACG GAA CGG CTT TAC TGC GGG GGC CCT ATG TTC AAC AGC AAG GGG 
Leu Thr Glu Arg Leu Tyr Cys Gly Gly ?ro Met ?he Asn Ser Lys Gly 
15 10 15 

GCC CAG TGT GGT TAT CGC CGC TGC CGT GCC AGT GGA GTT CTG CCT ACC 
Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 
20 25 30 



AGG GCC GCA GGC CTC CGG AAC CCG GAC TTT CTT GTC TGC GGA GAT GAT 
Arg Ala Ala Gly Leu Arg Asn Pro As? Phe Leu Val Cvs Gly Asn Asv 
50 55 60 * ' ' 

CTG GTC GTG GTG GCT GAG AGT 
Leu Val Val Val Ala Glu Ser 
65 70 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 71 arr--o acids 
(3 ) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : prccain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Leu Thr Glu Arg Leu Tyr Cys Giy Gly Pro Men Phe Asn Ser Lvs Glv 

1 5 10 i 5 

Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 
20 25 ' 3Q 

Ser Phe Gly Asn Thr He Thr Cys Tyr He Lys Ala Thr Ala Ala Ala 
35 40 45 

Arg Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cvs Gly Aso Aso 
50 55 ' 

Leu Val Val Val Ala Glu Ser 
S5 70 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE3S : single 

(D) TOPOLOGY: linear 



60 



48 



96 



AGC TTC GGC AAC ACA ATC ACT TGC TAC ATC AAG GCC ACA GCG GCT GCA 144 
Ser Phe Gly Asn Thr lie Thr Cys Tyr lie Lys Ala Thr Ala Ala Ala 
35 40 45 



192 



21j 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 



(vii) IMMEDIATE SOURCE : 

(3) CLONE: 3R36-23-13 

(A) NAME / KEY : CDS 

(B) LOCATION: 1. .213 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CTC ACG GAG CGG CT~ TAC TC,r r>~ - ^. . „™ 

- ~ ~ - £ £ £ £ E Z £ £ £ - 



10 Xi 



GCC CAG TGT GGT TAT CGC CGT TGC CS^ GC~ -c- r~- ^ 

Ala Gin Cys Gly Tyr Ar 3 Ara cys ta" £ ^ .^V ^ ° CT ACC 

"Si T 0 ' Se - ^a. ^eu Pro Thr 

! s ! ~ 25 3 0 

^ AGC TTC AC.-. ATC ACT TGT TAC A- AAA G" ^ 

W Ser Phe Gly As, Thr lie Thr Cys Tvr zll £ ^ ^ ^ ^ G f 

- A - a Axa Ala Ala 

4 s 35 40 45 

Si 

0 GCC GCA GGC CTC CGG AGC CCG GA- T— C~ G"~ t~- r~ ^ 

1 Lys "* -» - •» -p ~ - - £ £ £ £ 



55 60 



65 70 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 71 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : protein 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 
Leu Thr GIu Arg Leu Tyr Cys Gly Giv Pro Met Phe Asn Ser Lvs Gly 

Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Giv Val Leu P^o T - 

Ser Phe Gly Asn Thr lie Thr Cys Tyr lie Lys Ala Thr Ala Ala Ala 



43 



35 



14- 



192 



OTG GTC GTG GTG GCT GAG AGT 
Leu Val Val Val Ala Glu Ser 213 



40 45 
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Lys Ala Ala Gly Leu Arg Ser Pro Asp Phe Leu Val Cys Gly Asp Asp 
50 55 60 . 

Leu Val Val Val Ala Glu Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO : 7: 

(i) sequence characteristics : 

(A) LENGTH : 213 base pairs 

(3) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(3) CLONE : BR3S-23-2C 

(ix) FEATURE: 

(A) NAME / KEY : CDS 
(3) LOCATION: 1 . . 213 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTC ACQ GAG CGG CTT TAC TGC GGG GGC OCT ATG TTT AAC AGC AAA GGG 4i 

Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 

1 5 10 15 

GCC CAG TGT GGT TAT CGC CGT TGC CGT GCC AGT GGA GTT CTG CCT ACC 9' 

Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 

20 25 30 

AGC TTC GGC AAC ACA ATC ACT TGT TAC ATC AAA GCC ACA GCG GCC GCA 14' 

Ser Phe Gly Asn Thr He Thr Cys Tyr He Lys Ala Thr Ala Ala Ala 

35 40 45 

AAA GCC GCA GGC CTC CGG AGC CCG GAC TTT CTT GTC TGC GGA GAT GAT 19 

Lys Ala Ala Gly Leu Arg Ser Pro Asp Phe Leu Val Cys Gly Asp Asp 

50 55 60 

CTG GTC GTG GTG GCT GAG AGT 21 

Leu Val Val Val Ala Glu Ser 

65 70 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 71 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

Leu Thr Glu Arg Leu Tyr Cys Giy Gly Pro Met Phe Asn Ser Lys Gly 
15 xo 15 

Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 
20 25 3 0 

Ser Phe Gly Asn Thr He Thr Cys Tyr lie Lys Aia Thr Ala Ala Ala 
35 40 45 

Lys Ala Ala Gly Leu Arg Ser Pre As? Phe Leu Val Cys Gly Asi3 Asp 
50 55 60 

Leu Val Val Val Ala Glu Ser 
65 70 

(2) INFORMATION FOR SEQ ID NC : 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 213 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NC 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: BR33-2-17 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .213 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 

CTC ACG GAG CGG CTT TAG TGC GGG GGC OCT ATG TTC AAC AGC AAG GGG 
Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 
1 5 10 Is" 



43 



GCC CAG TGT GGT TAT CGC CGT ib; CGT GCC AGT GGA GTT CTG CCT ACC 96 

Aia Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 

20 25 30 

AGT TTC GGC AAC ACA ATC ACT TGT TAC ATC AAG GCC ACA GCG GCT GCA 144 

Ser Phe Gly Asn Thr He Thr Cys Tyr lie Lys Ala Thr Ala Ala Ala 

35 40 45 

AAA GCC GCA GGC CTC CGG AAC CCG GAC TTT CTT GTT TGC GGA GAT GAT 192 
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Lys Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp 
50 55 60 

TTG GTC GTG GTG GCT GAG AGT 
Leu Val Val Val Ala Glu Ser 
65 70 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amine acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SZQUZXCZ DESCRIPTION: SEQ ID NO: 10: 

Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 
1 5 10 15 

Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 
20 25 30 

Ser Phe Glv Asn Thr He Thr Cys Tyr He Lys Ala Thr Ala Ala Ala 
35 40 45 

Lys Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp 
50 * 55 60 

Leu Val Val Val Ala Glu Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(3) CLONE: BR33-2-21 

(ix) FEATURE: 

(A) NAME / KEY : CDS 
(3) LOCATION: 1 . - 213 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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CTC ACG 
Leu Thr 
1 

GCC CAG 
Ala Gin 

AGT TTC 
Ser Phe 



AAA GCC 
Lys Ala 
50 



GAG CGG 
Glu Arg 



TGT GGT 
Cys Gly 
20 

GGC AAC 
Gly Asn 
35 

GCA GGC 
Ala Glv 



CTT TAC 
Leu Tyr 
5 

TAT CGC 
Tyr Arg 



TGC 
Cys 



CGT 
Arci 



ACA ATC AC 



CTC CGG 
Leu Arc: 



AAC 
Asn 
5 5 



GGG GGC CCT ATG TTC AAC AGC AAG GGG 
Gly Gly Pro Met Phe Asn Ser Lys Gly 
10 1S 

TGT CGT GCC AGT GGA GTT CTG CCT ACC 
Cys Arg Ala Ser Gly Val Leu Pro Thr 

TGT TAC ATC AAG GCC ACA GCG GCT GCA 
Cys Tyr lie Lys Ala Thr Ala Ala Ala 
40 4S 

CCG GAC TTT CTT GTT TGC GGA GAT GAT 
Pro Asp Phe Leu Val Cys Gly Asp Asp 

eo 



48 



95 



144 



192 



TTG GTC 
Leu Val 
65 



GTG GTG 
Val Val 



GCT GAG 
Ala Glu 
70 



AGT 
Ser 



213 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE C-IAP.ACTERISTICS : 
(A) LENGTH : 71 amine acids 
(3) TYPE: amine acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lvs Glv 
1 5 10 Is 

Ala Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 
20 25 30 

Ser Phe Gly Asn Thr He Thr Cys Tyr lie Lys Ala Thr Ala Ala Ala 
35 40 45 

Lys Ala Ala Gly Leu Arg Asn Pro Aso Phe Leu Val Cys GW Asp As- 
50 55 60 * " " 

Leu Val Val Val Ala Glu Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 1 base pairs 
(3) TYPE : nucleic acid 

(C) STHANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
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(iii) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: HD10-2-3 

fix] FEATURE: 

(A) NAME/KEY: CDS 
(3) LOCATION: 2.. 541 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

C GTC GGC GCT CCT GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 4 6 

Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly 
1 ' 5 10 15 

GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA GGG AAT TTG ZZZ ' 94 

Val Arg Ala Leu Glu Asp Gly lie Asn ?he Ala Thr Gly Asn Leu Pro 
20 25 30 

P 

^ GGT TGC TCC TTT TCT ATC TTC CTT CTT GCT CTG TTC TCT TGC TTA ATC 14 2 

f& Gly Cys Ser ?he Ser He Phe Leu Leu Ala Leu Phe 3er Cys Leu lie 

CI 3 5 40 45 

i -. I 

CAT CCA GCA GCT AGT CTA GAG TGG CGG AAC ACG TCT GGC CTC TAT GTC 190 
1st KiS Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val 

Iy 50 =5 50 

-Am 

* CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAT GAG GCC GAT GAC 23 3 

S Leu Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala Asp Asp 
01 65 " ' 70 75 

Q 

fjl GTT ATT CTG CAC ACA CCC GGC TGT GTA CCT TGT GTT CAG GAC GGT AAT 29 6 

Q Val He Leu His Thr Pro Giy Cys Val Pro Cys Val Gin Asp Gly Asn 

|!l 80 85 90 95 

ACA TCT GCG TGC TGG ACC CCA GTG ACA CCT ACA GTG GCA GTC AGG TAG 334 
Thr Ser Ala Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr 
100 105 110 

GTC GGA GCA ACC ACC GCT TCG ATA CGC AGG CAT GTA GAC ATG TTG GTG 382 
Val Gly Ala Thr Thr Ala Ser He Arc Arg His Val Asp Met Leu Val 
115 120 125 

GGC GCG GCC ACG ATG TGC TCT GCT CTC TAC GTG GGT GAT ATG TGT GGG 430 
Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly As? Met Cys. Giy 
130 135 140 



GCC GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA CCT CGT CGC CAT 

Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg Pro Arg Arg His 

145 150 155 

CAA ACG GTC CAG ACC TGT AAC TGC TCA CTG TAC CCA GGC CAT CTT TCA 

Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 

16 0 165 170 175 



478 



526 
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GGA CAC CGA ATG GCT 54 
Gly His Arg Met Ala 
180 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 
{A) LENGTH: 18 0 amino acids 
(3) TYPE: ammo acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Val Gly Ala Pro Val Gly Gly Vai Ala .Arg Ala Leu Ala His Gly Vai 
15 10 IS 



Arg Ala Leu Glu As? Gly lie Asn Phe Ala Thr Gly Asn Leu Pro Giv 

20 25 30 



Cys Ser Phe Ser lie Phe Leu leu Ala Leu Phe Ser Cvs Leu II' 
35 40 45 



Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
50 55 



;u Tvr Val Leu 



6 0 



Thr Asn As? Cys Ser Asn Ser Ser He Vai Tyr Glu Ala Asp Asp Vai 

65 70 75 80 

lie Leu His Thr Pro Gly Cys Val Pro Cys Vai Gin Asp Gly Asn Thr 

85 90 55 



CP 



Ser Ala Cys Trp Thr Pro Val Thr Pro Thr Vai Ala Vai 



100 



105 



110 



Gly Ala Thr Thr Ala Ser He Arg Arc His Vai Asp Met Leu Vai Gly 
115 120 " 125 



Ala Ala Thr Met Cys Ser Ala Leu Tyr Vai Gly Aso Met Cys Gly Ala 
130 135 ' 140 



Vai Phe Leu Vai Gly Gin Ala Phe Thr Phe Arg Pro Arg Arg His Gin 

145 150 155 ISO 

Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser Gly 

165 170 175 



His Arg Met Ala 
130 



(2) INFORMATION FOR SEQ ID NC : 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : single 
<DJ TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CBNA 
(ill) HYPOTHETICAL: NO 
(ill) ANTI- SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(3) CLONE : HD10-2-14 



(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 541 



(xi) SEQUENCE DESCRIPTION: SEC ID NO: 15: 

C GT r GGC GCT CCT GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 
Vai Glv Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Giv 
i * 5 10 15 

J GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA GGG AAT TTG CCC 

m val Arg Ala Leu Glu Asp Gly He Asr. Phe Ala Thr Glv Asn Leu Pro 

™ , 0 25 30 

W GGT TGC TCC TTT TCT ATC TTC CTT CCT GCT CTG TTC TCT TGC TTA ATC 

G iy Cvs Ser Phe Ser lie Phe Leu Pro Ala Leu Phe Ser Cys Leu xle 
35 «° 45 



CA^ CCA GCA GCT AGT CTA GAG TGG CGG AAC ACG TCT GGC CTC TAx G,C 
H^s Pro Ala Aia Ser Leu Glu Tr? Arg Asn Thr Ser Gly Leu Tyr Va* 

01 50 55 60 

C- ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAT GAG GCC GAT GAC 
Leu Thr Asn As? Cys Ser Asn Ser Ser lie Val Tyr Glu Ala As? As? 



65 70 75 

GT- ATT CTG CAC ACA CCC GGC TGT GTA CCT TGT GTT CAG GAC GGT AAT 
Va1 * T^ a Le „ wis Thr Pro Gly Cys Val Pro Cys Val Gin Asp Gly Asn 

80 " 35 90 95 

ACA TCT GCG TGC TGG ACC CCA GTG ACA CCT ACA GTG GCA GTC AGG TAC 

Thr Ser Ala Cys Tro Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr 

100 1°5 H° 

GTC GGA GCA ACC ACC GCT TCG ATA CGC AGG CAT GTA GAC ATA TTG GTG 
Val Glv Ala Thr Thr Ala Ser lie Arg Arg Bis Val Asp He Leu Val 
115 120 123 

GGC GCG GCC ACA ATG TGC TCT GCT CTC TAC GTG GGT GAT ATC- TGT GGG 
Gly Ala Ala Thr Mer Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly 



130 



135 



G^c TTC CTC GTG GGA CAA GCC "TTC ACG TTC AGA CCT CGT CGC CAT 
Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg Pro Arg Arg his 
145 1-0 155 

CAA ACG GTC CAG ACC TGT AAC TGC TCA CTG TAC CCA GGC CAT CTT TCA 



46 



190 



233 



236 



334 



332 



430 



473 



525 
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Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 
150 "5 170 175 

GGA CAC CGA ATG GCT 
Gly His Arg Mec Ala 

19C 



541 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 
(AJ LZTiGTd: 13 0 amine acids 
(3) TYPE: amino acid 
(D) TOPOLOGY : linear 

(iii MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Va^ 
15 10 15 

Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr Giv As- ' e- P-o ^ v 
20 25 * " ^ ~ 



vs Leu lie Kis 



Cys Ser Phe Ser He ?he Leu Pro Ala Leu Phe Ser C 

33 40 45 

Pro Ala Ala Ser Leu Glu Trp Arg Asr. Thr Ser Giv Leu tv- Va 1 L— 
30 55 SO * ~ 

Thr Asr. As? Cys Ser Asr. Ser Ser He Val Tyr Glu Ala Aso Asd Val 
65 70 75 so 

He Leu His Thr Pro Gly Cys Val Pro Cys Val Glr. Ass Giv Asr. Thr 

Ser Ala Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Ara Tyr Va^ 
100 105 no 

Gly Ala Thr Thr Ala Ser He Arg Arg His Val Asp He Leu Val Gly 
115 120 i2S 

Ala Ala Thr Me; Cys Ser Ala Leu Tyr Val Giv Aso Mec Cvs G^v Ala 
130 135 ' 14 * 0 

Val Phe Leu Val Gly Glr. Ala Phe Thr Phe Arg Pro Arg Ara His Gin 
145 150 155 - , 160 

Thr Val Gin Thr Cys Asr. Cys Ser Leu Tyr Pro Gly His Leu Ser Gly 
1S5 170 173 

His Arg Met Ala 
ISO 

(2) INFORMATION FOP. SZQ ID NO: 17 : 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE : 

(3) CLONE: KD10-2-21 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
(3) LOCATION: 2 . . 541 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

^ C GTC GGC GCT CCT GTA GGA GGC GTC GCA AGA GCZ CTT GCG CAT GGC 4 c 

.5 Val GIv Ala Pro Vai Gly Gly Val Ala Arg Ala Leu Ala His Gly 

^ GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA GGG AAT TTG CCC 94 

LH Va i A i a £,eu Glu Asp Gly He Asn ?he Ala Thr Gly As- Leu Pre 

i« ~ 20 25 30 

* GGT TGC TCC TTT TCT ATC TTC CTT CTT GCT CTG TTC TCT TGC TTA ATC 14 2 

C Gly Cys Ser Phe Ser He ?he Leu Leu Ala Leu ?he Ser Cys Leu lis 

01 3 5 4 0 4 5 

u 

IJl CAT CCA GCA GCT ACT CTA GAG TGG CGG AAC ACG TCT GGC CTC TAG GTC 19 0 

Q His Pro Aia Ala Ser Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Vai 

|I 5 0 55 6 0 

CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAT GAG GCC GAT GAC 23 3 

Leu Thr Asn Aso Cys Ser Asn Ser Ser He Vai Tyr Glu Aia Asp Asp 
65 " 70 75 

GTT ATT CTG CAC ACA CCC GGC TGT GTA CCT TGT GTT CAG GAC GGT AAT 2 36 

Vai He Leu His Thr Pro Gly Cys Val Pro Cys Val Gin Asp Gly Asn 
80 85 SO 95 



ACA TCT GCG TGC TGG ACC CCA GTG ACA CCT ACA GTG GCA GTC AGG TAC 
Thr Ser Aia Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr 
100 105 HO 

GTC GGA GCA ACC ACC GCT TCG ATA CGC AGG CAT GTA GAC ATA TTG GTG 
Val Gly Ala Thr Thr Ala Ser He Arg Arg His Vai Asp He Leu Val 
115 120 125 

GGC GCG GCC ACG ATG TGC TCT GCT CTC TAC GTG GGT GAT ATG TGT GGG 
Gly Aia Aia Thr Me: Cys Ser Ala Leu Tyr Val Gly As? Me: Cys Gly 
130 * H5 1-40 



334 



382 



430 
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GCC GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA CCT CGT CGC CAT 
Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg Pro Arg Arg His 
145 150 155 



478 



CAA ACG GTC CAG ACC TGT AAC TGC TCA CTG TAC CCA GGC CAT CTT TCA 
Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 
160 153 170 175 



526 



GGA CAC CGA ATG GCT 
Gly His Arg Men Ala 
180 



541 



(2) INFORMATION FOR SEQ ID NO : 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : protein 

<xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 13: 

Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val 
15 10 15 

Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr Gly Asn Leu Pro Gly 
20 25 30 

Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Phe Ser Cys Leu lie His 
35 40 45 

Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val Leu 
50 55 50 

Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala Asp Asp Val 
65 70 75 8 0 

He Leu His Thr Pro Gly Cys Val Pro Cys Val Gin Asp Gly Asn Thr 
85 90 95 

Ser Ala Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr Val 
100 105 110 

Gly Ala Thr Thr Ala Ser He Arg Arg His Val Asp He Leu Val Gly 
115 120 125 

Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly Ala 
130 135 140 

Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg Pro Arg Arg Kis Gin 
145 150 155 160 

Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser Gly 
165 170 175 

His Arg Met Ala 
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180 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 
(Ci STP-ANCECNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL : NO 

(iii) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 

(3) CLONE: BR36-9-I3 

(ix) FEATURE: 

(A) NAME / KEY : CDS 
(3) LOCATION : 2 . . 541 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 19: 

C GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 4 5 

Val Gly Ala Pro Val Giy Gly Val Ala Arg Ala Leu Ala Kis Giy 
1 5 10 15 

GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA GGG A^T TTG CCC 94 
Val Arg Ala Leu Glu Asp Giy He Ash Phe Ala Thr Gly Asa Leu Pre 
20 25 30 

GGT TGC TCC TTT TCT ATT TTC CTT CTT GCT CTG TTC TCT TGC TTA ATT 14 2 

Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe Ser Cys Leu He 
35 40 45 

CAT CCA GCA GCT AGT CTA GAG TGG CGG AAT ACG TCT GGC CTC TAT GTC 190 
His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val 
50 55 60 

CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAC GAG GCC GAT GAC 23 8 

Leu Thr Asn Asp Cys Ser Asr. Ser Ser He Val Tyr Glu Ala Asp Asp 
65 70 75 

GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC CAG GAC GGC AAT 23 5 

Val lie Leu His Thr Pro Gly Cys He Pro Cys Val Gin Asp Gly Asn 
80 85 90 95 

ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG GCA GTC AAG TAC 3 34 

Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Lys Tyr 
100 105 110 

GTC GGA GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG GAC CTA TTA GTG 33 2 

Val Gly Ala Thr Thr Ala Ser lie Arg Ser His Val Asp Leu Leu Val 
115 120 125 
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GGC GCG GCC ACG ATG TGC TCA GCG CTC TAG GTG GGT GAT ATG TGT GGG 

Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Glv Asp Met Cys Gly 
130 135 - 14j) 

GCC GTC TTC CTT GTG GGA CAA GCC TTC ACG TTC AGA CCT CGT CGC CAT 

Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg o ro ^ g ^ v < s 
145 150 155 



CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA GGC CAT t f CA 

Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 

175 



160 1^5 170 



GGA CAT CGA ATG GCT 
Gly His Arg Met: Ala 
130 



430 



473 



52S 



541 



(2) INFORMATION FOR SEQ ID NC ; 20: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 18C amino acids 
(3) TYPE: am^nc acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : crotein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

/ai 



Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Glv Val 



1 5 



10 i 



Arg Ala Leu Glu As? Gly He Asn Phe Ala Thr Glv Asn L*u P— G"v 
20 2 5 ' 3Q 

Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe Ser Cvs Leu lie His 

35 40 45 

Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser Glv Leu Tvr Va" Leu 
50 53 g~ 



a Asp Asp Val 



Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr C u A 1 

65 70 75 " ~ 80 

He Leu His Thr Pro Gly Cys He Pro Cys Val Gin Asp Gly Asn Thr 

85 90 93 

Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Lys Tyr Val 

100 105 iIq 



Gly Ala Thr Thr Ala Ser He Arg Ser His Val 



115 i 



Asp Leu Leu Val Gly 

125 



Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Aso Met: Cvs Glv Ala 

"0 135 14 c 

Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg Pro Arg Arg His Gin 

145 150 155 1S0 

Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Glv His Leu Ser Glv 
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165 170 175 

His Arg Met Ala 
180 

(2) INFORMATION FOR SEQ ID NO : 21: 

(A) LENGTH: 541 base pairs 

(3) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3R35-9-20 

(ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION : 2.. 541 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NC : 21: 

C GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 4 6 

Val Gly Ala Pro Val Gly Giy Vai Ala Arg Ala Leu Ala His Giy 
15 10 15 

GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA AC A GGG AAT TTG CCC 94 
Val Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr Gly Asn Leu Pro 
20 25 30 

GGT TGC TCC TTT TCT ATT TTC CTT CTT GCT CTG TTC TCT TGC TTA ATT 14 2 

Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Phe Ser Cys Leu lie 
35 40 45 

CAT CCA GCA GCT AGT CTA GAG TGG CGG AAT ACG TCT GGC CTC TAT GTC 190 
His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser Giy Leu Tyr Val 
50 55 60 

CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAC GAG GCC GAT GAC 
Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala Asp As? 
65 " 70 75 

GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC CAG GAC GGC AAT 
Val lie Leu His Thr Pro Gly Cys lie Pro Cys Val Gin As? Gly Asn 
80 85 90 95 

ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG GCA GTC AAG TAC 3 34 

Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Lys Tyr 
100 105 HO 



238 



286 
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Val aS ^ if A ' A CGC AGT « GTG GAC CTA TTA GTG 

Val Gly Ala Thr Thr Ala Ser lie Arg Ser His Val Aso Leu Leu Val 

115 120 " 12S 

gTv IT' J? f G CTC TAC GTG 007 GA C ATG TGT GGG 

Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly 

* 35 140 

a" ^ -I ™ ^ GCC ™ C ACG ™ CCT CGT CGC CAT 

5 " eU ^ GIy G " Ala 5he Th ^ Arg Pro Arg Arg His 

145 130 155 

^ ^ CTG TAC CCA GGC ^ TCA 

G.a Hr Vai Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 



"5 170 17: 



160 

GGA CAT CGA ATG GCT 
Gly His Arg Met: Ala 
130 



(2) INFORMATION FOR SEQ 10 NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: ISO amir. a acids 
(3) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
Val Gly Ala Pro Val Gly Gly Val Al 



a Arg Ala Leu Ala His Glv Val 
5 10 i = 



Arg Ala Leu Glu Aso Gl 



y lie Asn Phe Ala Thr Gly Asn Leu Pro Gly 

30 



20 25 



Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Phe Ser Cys Leu lie His 
35 40 



45 



Pro Ala Ala Ser Leu Glu T 

50 55 



rp Arg Asn Thr Ser Gly Leu Tyr Val Leu 

60 



Thr Asn Asp Cys Ser Asn Ser Ser He Val 



65 7 0 



Tyr Glu Ala Asp Asp Val 
75 80 



He Leu His Thr Pro Gly Cys He Pro Cys Val Gin Asp Gly Asn thr 

85 90 95 

Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Lvs Tvr Vai 
100 105 ii Q 

Giy Ala Thr Thr Ala Ser He Arg Ser His Val Asp Leu Leu Val Gly 
115 120 " i23 



Ala Ala Thr Met Cys Ser Ala Leu Tyr 



130 135 14(J 



Val Gly Asp Met Cys Gly Ala 



382 



430 



478 



525 



541 
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Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg Pro Arg Arg His Gin 
145 150 155 ISO 

Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser Gly 



155 



170 



175 



His Arg Mec Ala 



(2) INFORMATION rCR SEQ ID NO: 23: 

(1) SEQUENCE CZ-IARACTZ3IS7ICS : 

(A) LENGTH: 541 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NC 
(ill) ANT I - SENSE : NC 



(vii) IMMEDIATE SCURCE : 

(5) CLONE : 5E3 3-1-I0 



(ix) FEATURE: 

(A) NAME /KEY : 02 S 

(B) LOCATION: 2 . .541 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



0 GTC GGC GCT CCC GTA GC 
Val Gly Ala Pre Val GI 
1 5 



GGC GTC GCA AGA GCC CTT GOG CAT GGC 
Gly Val Ala Arg Ala Leu Ala His Gly 
10 15 



4 = 



GTG AGG GCC CTT GAG GAC 
Val Arg Ala Leu Glu Asp 
20 



GGG ATA AAC TTC GCA ACA GGG AAT TTG CCC 
Gly lie Asn Phe Ala Thr Gly Asn Leu Pro 
25 30 



GGT TGC TCC TTT TOT AT: 
Gly Cys Ser Phe Ser 11= 
3 5 



CAT CCA GCA GCT GGT 

His Pro Ala Ala Gly 
50 

CTT ACC AAC GAC TGT 

Leu Thr Asn Asp Cys 
65 



CT-- 
Le- 



Ser 



GTT ATT CTG CAC GCG CCC 
Val lie Leu His Ala Pre 
80 8 5 



TTC CTT CTT GCT CTG TTC TCT TGC TTA ATC 
Phe Leu Leu Ala Leu Phe Ser Cys Leu lie 
40 45 

GAG TGG CGG AAT ACG TCT GGC CTC TAT GTC 
Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val 
55 60 

AA7 AGT AGT ATT GTG TAT GAG GCC GAT GAC 
Asn Ser Ser lie Val Tyr Glu Ala Asp Asp 

7 0 7 5 

GGC TGT GTA CCT TGT GTC CAG GAC GGC AAT 
Gly Cys Val Pro Cys Val Gin Asp Gly Asn 
90 95 



142 



150 



233 



236 
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ACG TCT ACA TGC TGG ACC 
Thr Ser Thr Cys Trp Thr 

100 

GTC GGG GCA ACC ACC GCT 
Val Gly Ala Thr Thr Ala 
115 

GGC GCG GCC ACG ATG TGC 
Gly Ala Ala Thr Mec Cys 
130 

GCC GTC TTC CTC GTG GGA 
Ala Val Phe Leu Val Gly 
145 



114 

CCA GTA ACA CCT 
Pro Val Thr Pro 
105 

TCG ATA CGC AGT 
Ser lie Arg Ser 
120 

TCT GCG CTT TAC 
Ser Ala Leu Tyr 

135 

CAA GCC TTC ACG 
Gin Ala Phe Thr 

150 
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ACA GTG GCA GTC AGG TAC 33 4 

Thr Val Ala Val Arg Tyr 
110 

CAT GTG GAC CTG TTA GTA 38 2 

His Val Asp Leu Leu Val 
125 

GTG GGT GAT ATG TGT GGG 43 0 

Val Gly Asp Mec Cys Gly 
140 

TTC AGA CCC CGC CGC CAT 4 78 

Phe Arg Pro Arg Arg His 
155 



ffs 



CAA ACG GTC CAG ACC TGT 
Gin Thr Val Glr. Thr Cys 
150 155 



AAC TGC TCG CTG 
Asn Cys Ser Leu 



TAC CCA GGC CAT CTT TCA 
Tyr Pro Gly His Leu Ser 
170 175 



GGA CAT CGC ATG 
Gly His Ar=r Met 



GCT 
Ala 

180 



(2) INFORMATION FCR SEQ ID NO : 24: 



525 



rn 

Vf ' 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 190 amine acicis 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val 
15 10 15 



Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr Gly Asn Leu Pro Glv 

20 25 30 

Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Phe Ser Cys Leu lie His 

' 35 40 45 

Pro Ala Ala Gly Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val Leu 

50 55 60 

Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala Asp Asp Val 

65 70 75 ao 

lie Leu His Ala Pro Gly Cys Val Pro Cys Val Gin Asp Gly Asn Thr 

6 5 90 95 

Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr Val 

100 105 110 



Gly Ala Thr Thr Ala Ser lie Arg Ser His Val Aso Leu Leu Val Glv 
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115 120 125 

Ala Ala Thr Mec Cys Ser Ala Leu Tyr Val Gly Asp Met: Cys Gly Ala 
130 135 140 

Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg Pro Arg Arg His Gin 
145 150 155 ISO 

Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser Gly 
155 170 175 

His Arg Met Ala 
130 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNZ33 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NC 
(iii) ANTI-SENSE: NC 



(vii) IMMEDIATE SOURCE: 

(3) CLONE: 3R33-I-19 

(:x) FEATURE : 

(A) NAME /KEY : CDS 
(3) LOCATION: 2. .541 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NC : 25: 

C GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 
Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly 
1 5 10 15 

GTG AGG GCC CTT GAG GAC GGG ATA AAC TTC GCA ACA GGG AAT TTG CCC 
Val Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr Gly Asn Leu Pro 
20 25 30 

GGT TGC TCT TTT TCT ATC TTC CTT CTT GCT CTG TTC TCT TGC TTA ATC 
Glv Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Phe Ser Cys Leu lie 
35 40 45 

CAT CCA GCA GCT GGT CTA GAG TGG CGG AAT ACG TCT GGC CTC TAT GTC 
His Pro Ala Ala Gly Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val 
50 55 SO 

CTT ACC AAC GAC TGT TCC AAT AGT AGT ATT GTG TAT GAG GCC GAT GAC 
Leu Thr Asn Aso Cys Ser Asn Ser Ser He Val Tyr Glu Ala As? As? 
65 ' 70 75 
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GTT ATT 
Val lie 
80 



CTG CAC GCG 
Leu His Ala 



CCC GGC TGT GTA CCT TGT GTC 
Pro Gly Cys Val Pro Cys Val 
35 90 



GAG GAC 
Gin Aso 



GGC AAT 
Gly Asn 
95 



286 



ACG TCT ACA TGC TGG ACC CCA GTA ACA CCT ACA GTG 
Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
100 105 



GCA GTC 
Ala Val 



AGG TAC 
Arg Tyr 
110 



334 



GTC GGG GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG 
Val Gly Ala Thr Thr Ala Ser lie Arg Ser His Val 
115 120 



GAC CTG 
Asp "Leu 
125 



TTA GTA 
Leu Val 



382 



GGC GCG 
Gly Ala 



GCC ACG ATG 
Ala Thr Met: 
130 



TGC 
Cys 



TCT 
Ser 



GCG 
Ala 
135 



CTT TAC GTG GGT 
Leu Tyr Val Gly 



GAT ATC 
Asp Me: 
140 



TGT GGG 
Cys Gly 



430 



GCC GTC 
Ala Val 
145 



TTC CTC GTG 
Phe Leu Val 



GGA 
Gly 



CAA 
Gin 
150 



GCC TTC ACG TTC AGA CCC CGC 
Ala Phe Thr Phe Ara Pro Arc: 



CGC CAT 
Arg Kis 



478 



A ACC 



Gin Thr 

150 



GTC CAG ACC 
Val Gin Thr 



TGT 
Cys 
155 



AAC 
Asn 



TCG CTG TAC C 



■ TV JAJ 



C CAT CTT TCA 



Cys Ser Leu Tyr Pro Gly Kis Leu Ser 
170 175 



GGA CAT 

Gly His 



CGA ATG GCT 
Arg Me: Ala 
180 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: proeein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26: 

Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val 
1 5 10 15 

Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr Gly Asn Leu Pro Gly 
20 25 30 

Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Phe Ser Cys Leu He His 
35 40 45 

Pro Ala Ala Gly Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val Leu 
50 55 so 

Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala Asp Asp Val 
65 70 75 80 

He Leu His Ala Pro Gly Cys Val Pro Cys Val Gin Asp Gly Asn Thr 
85 90 S5 
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Ser Thr Cys Trp Thr Pro Val Thr Pro TSir Val Ala Val Arg Tyr Val 

100 105 no 

Gly Ala Thr Thr Ala Ser He Arg Ser His Val Asp Leu Leu Val Gly 

115 120 125 

Ala Ala Thr Mec Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly Ala 

130 135 140 



Val Phe Leu Val Gly Glr. Ala ?he Thr Phe Arg Pro Arg Arg His Gin 
145 150 155 160 

Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser Gly 



155 



170 



175 



His Arg Mec Ala 
180 

(2) INFORMATION FOR SEQ ID NO: 27: 



m 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 
(3) TYPE: nucleic acid 

(C) STRA:;DEDNE2S : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYRE : cDNA 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(3) CLONE: BR33-1-20 
Cix) FEATURE: 

(A) NAME /KEY: CDS 
(3) LOCATION: 2 . .541 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

C GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 4 c 

Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly 
1 5 10 15 

GTG AGG GCC CTT GAG GAC GGG ATA AAC TTC GCA ACA GGG AAT TTG CCC 94 
Val Arg Ala Leu Glu As? Gly lie Asn Phe Ala Thr Gly Asn Leu Pro 
20 25 30 

GGT TGC TCT TTT TCT ATC TTC CTT CTT GCT CTG TTC TCT TGC TTA ATC 14 2 

Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe Ser Cys Leu He 
35 40 45 

CAT CCA GCA GCT GGT CTA GAG TGG CGG AAT ACG TCT GGC CTC TAT GTC 130 
His Pro Ala Ala Gly Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val 
50 55 60 
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CTT ACC AAC GAC TGT TCC AAT AGT 
Leu Thr Asn Asp Cys Ser Asn Ser 
65 70 



AGT 
Ser 



ATT 
lie 



GTG TAT GAG GCC 
Val Tyr Glu Ala 
75 



GAT GAC 
Aso Asc 



233 



GTT ATT CTG CAC GCG CCC GGC TGT GTA CCT 

Val He Leu His Ala Pro Gly Cys Val Pro 
80 as 



TGT GTC CAG GAC 
Cys Val Gin Asp 
90 



GGC AAT 
Gly Asn 
95 



28o 



AC G TCT ACA TGC 
Thr Ser Thr Cys 



CCA GTA 
Pro Val 



ACA 
Thr 



100 



CCT 
Pro 
105 



ACA GTG GCA GTC 
Thr Val Ala Val 



AGG TAC 
Arg Tyr 
110 



334 



GTC GGG 
Val Glv 



GCA ACC 
Ala Thr 
115 



ACC 
Thr 



GCT 
Ala 



TCG ATA 
Ser He 



CGC 
Arg 
120 



AGT CAT GTG GAC CTG TTA GTA 
Ser His Val Asp Leu Leu Val 
125 



332 



GGC GCG GCC ACG ATG TGC TCT GCG CTT TAC GTG GGT GAT ATG 
Gly Ala Ala Thr Me- Cys Ser Ala Leu Tyr Val Gly Asp Men 
130 135 140 



TGT GGG 
Cys Glv 



430 



GCC GTC 
Ala Val 
145 



TTC CTC 
Phe Leu 



GTG 
Val 



GGA 
Glv 



CAA GCC 
Gin Al a 
150 



TTC ACG TTC AGA CCC CGC CGC CAT 
Phe Thr Phe Arg Pro Arg Arg His 
155 



CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA GGC CAT CTT TCA 
Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 
150 165 170 175 



GGA CAT 
Glv His 



CGA ATG 
Arg Men 



GCT 
Ala 
130 



541 



(2) INFORMATION FOP. SZQ ID NO: 23: 



tar 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 180 amino acids 
(3) TYPE : amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 28: 

Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val 

15 10 15 

Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr Gly Asn Leu Pro Gly 

20 25 30 



Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Phe Ser Cys Leu 
35 40 45 



lie His 



Pro Ala Ala Gly Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr 
50 55 60 



Val Lei 



Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala Asp Asp Val 
65 70 75 80 
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lie Leu His Ala Pro Gly Cys Val Pro Cys Val Gin Asp Gly Asn Thr 
85 90 95 

Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr Val 

100 105 no 

Gly Ala Thr Thr Ala Ser lie Arg Ser His Val Asp Leu Leu Val Gly 
115 120 125 

Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly Ala 
130 135 140 

Val Phe Leu Val Gly Gin Ala ?he Thr Phe Arg Pro Arg Arg His Gin 
145 150 155 160 

Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser Gly 
155 170 175 



-JJS3- 



His Arg Men Ala 
130 

(2) INFORMATION FOR SEQ 10 NO: 29: 



yj 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 base pairs 
(3) TYPE : nucleic acid 

(C) S TRANOEDNES S : single 

(D) TOPOLOGY: linear 



m 



U1 

o 



(ii) MOLECULE TYPE: cDN~ 
(ill) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: HCC115 3 



(ix) FEATURE: 

(A) NAME / KEY : COS 
(3) LOCATION: 3 . .237 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TA GAC TTT TGG GAG AGO GTC TTC ACT GGA CTA ACT CAC ATA GAT GCC 4 7 

Asp Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His He Asp Ala 
15 10 15 

CAC TTT CTG TCA CAG ACT AAG CAG CAG GGA CTC AAC TTC TCG TTC CTG 9 5 

His Phe Leu Ser Gin Thr Lys Gin Gin Gly Leu Asn Phe Ser Phe Leu 

2 0 2 5 3 0 

ACT GCC TAC CAA GCC ACT GTC- TGC GCT CGC GCG CAG GOT CCT CCC CCA 14 3 

Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro Pro 

35 40 45 

AGT TGG GAC GAG ATG TGG AAG TGT CTC GTA CGG CTT AAG CCA ACA CTA 191 



SUBSTITUTE SHEET (RULE 26) 



WO 94£5i 



120 




PCT/EP94/01323 



Ser Trp Asp Glu Met: Trp Lys Cys Leu Val Arg Leu Lys Pro Thr Leu 
50 55 60 



CAT GGA CCT ACG CCT CTT CTA TAT CGG TTG GGG CCT GTC CAA AAT GAA 
His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gin Asn Glu 
65 70 75 



239 



ATC TGC TT 3 ACA CAT CCC ATC ACA AAA TnC AT— ATG G—A TGC ATj TCA 
lie Cys Leu Thr His Pro lie Thr Lys Tyr He Mec Ala Cys Met Ser 
80 35 90 95 



297 



(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : crocein 

fg (xi) SEQUENCE DESCRIPTION : SEQ ID NC : 3C: 

\i 

Aso Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His lie Asp Ala His 
S 1*5 10 15 

n f Phe Leu Ser Gin Thr Lys Gin Gin Gly Leu Asn Phe Ser Phe Leu Thr 

4« 2 0 2 5 3C 

O Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser 

111 35 40 45 

Iff Trp Asp Glu Met Trp Lys Cys Leu Val Arg Leu Lys Pro Thr Leu His 

5 0 5 5 6 0 

Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gin Asn Glu lie 
65 70 75 80 

Cys Leu Thr His Pro lie Thr Lys Tyr He Met Ala Cys Met Ser 
85 90 95 

(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 
(ill) ANTI -SENSE : NO 



(vii) IMMEDIATE SOURCE : 

(3) CLONE: HD10-1-25 
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(ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 3.. 401 



(xii SEQUENCE DESCRIPTION: SEQ ID NO : 31: 

TC CAA AAT GAA ATC TGC TTG ACA CAC CCC GTC ACA AAA TAC ATT ATG 4 
Gin Asn Glu He Cys Leu Thr His Pro Val Thr Lys Tyr He Mec 
15 10 15 

GCA TGC ATG TCA GCT GAT CTG GAA GTA ACC ACC AGC ACC TGG GTG TTG 9 
Ala Cys Met Ser Ala As? Leu Glu Val Thr Thr Ser Thr Trp Val Leu 
20 25 30 

CTT GGA GGG GTC CTC GCG GCC CTA GCG GCC TAC TGC TTG TCA GTC GGC 14 
Leu Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly 
35 4G 45 

TGC GTT GTA ATC GTG GGT CAT ATC GAG CTG GGG GGC AAG CCG GCA CTC 19 
Cys Val Val lie Val Gly His He Glu Leu Gly Gly Lys Pro Ala Leu 
50 55 60 

GTT CCA GAC AAG GAG GTG TTG TAT CAA CAG TAC GAT GAG AiG GAG GAG 2 3 

Val Pro Asp Lys Glu Val Leu Tyr Gin Gin Tyr Asp Glu Me: Glu Glu 
65 70 75 

TGC TCG CAA GCC GCC CCA TAC ATC GAA CAA GCT CAG GTA ATA GCC CAC 2S 
Cys Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His 

80 85 90 95 

CAG TTC AAG GAG AAA ATC CTT GGA CTG CTG CAG CCA GCC ACC CAA CAA 3 3 

Gin Phe Lys Glu Lys He Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin 
100 105 110 

CAA GCT GTC ATT GAG CCC GTA ATA GCT TCC AAC TGG CAA AAG CTT GAA 3 3 

Gin Ala Val He Glu Pre Val He Ala Ser Asn Trp Gin Lys Leu Glu 
115 120 125 

ACC TTC TGG CAC AAG CAT 4 0 

Thr Phe Trp His Lys His 
130 



(2} INFORMATION FOR SEQ ID NO : 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Gin Asn Glu He Cys Leu Thr His Pro Val Thr Lys Tyr He Met Ala 
15 10 15 
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Cys Met Ser Ala Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu Leu 
20 25 30 

Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys 
3S 40 45 

Val Val lie Val Gly His lie Glu Leu Gly Gly Lys Pro Ala Leu Val 
50 55 60 

Pro Asp Lys Glu Val Leu Tyr Gin Gin Tyr Asp Glu Mec Glu Glu Cys 
65 70 75 80 

Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His Gin 
35 90 95 

Phe Lys Glu Lys lie Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin Gin 

ICO 105 no 

Ala Val He Glu Pre Val He Ala Ser Asn Trp Gin Lys Leu Glu Thr 

115 120 125 

Phe Trp His Lys His 
130 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 401 base pairs 
(3) TYPE : nucleic acic 

(C) STRAND EDNE3S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE : 

(B) CLONE: HD 10-1-3 

(ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 3 . .401 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TC CAA AAT GAA ATC TGC TTG ACA CAC CCC GTC ACA AAA TAC ATT ATG 4 7 

Gin Asn Glu He Cys Leu Thr His Pro Val Thr Lys Tyr He Met 
1 5 10 15 

GCA TGC ATG TCA GCT GAT CTG GAA "GTA ACC ACC AGO ACC TGG GTG TTG 9 5 

Ala Cys Met Ser Ala Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu 
20 25 30 

CTT GGA GGG GTC CTC GCG GCC CTA GCG GCC TAC TGC TTG TCA GTC GGC 14 3 
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Leu Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly 
35 40 45 

TGC GTT GTA ATC GTG GGT CAT ATC GAG CTG GGG GGC AAG CCG GCA CTC 
Cys Val Val lie Val Gly His lie Glu Leu Gly Gly Lys Pro Ala Leu 
50 55 60 

GTT CCA GAC AAG C-AG GTG TTG TAT CAA CAG TAC GAT GAG ATG GAG GAG 
Val Pro Asp Lys Glu Val Leu Tyr Gin Gin Tyr Asp Glu Mec Glu Glu 
65 70 75 

TGC TCG CAA GCC GCC CCA TAC ATC GAA CAA GCT CAG GTA ATA GCC CAC 
Cys Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His 
80 35 90 95 

CAG TTC AAG GAG AAA ATC CTT GGA CTG CTG CAG CGA GCC ACC CAA CAA 
Gin Phe Lys Glu Lys lie Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin 

100 105 HO 

CAA GCT GTC ATT GAG CCC GTA ATA GCT TCC AAC TGG CAA AAG CTT GAA 
Gin Ala Val He Glu Pro Val He Ala Ser Asn Trp Gin Lys Leu Glu 
115 120 1C5 



191 



335 



3S: 



ACC TTC TGG CAC AAG CAT 
Thr Phe Trp His Lys His 
130 



INFORMATION FOR SEQ -3 NO : 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 3 amine acids 
(3) TYPE : arcino acid 
(D) TO PC LOGY : linear 



(ii) MOLECULE TYPE : procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Gin Asn Glu lie Cys Leu Thr His Pro Val Thr Lys Tyr He Met Ala 
1 ' 5 10 15 

Cys Met Ser Ala Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu Leu 
20 25 30 

Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys 
35 40 45 

Val Val lie Val Gly His He Glu Leu Gly Gly Lys Pro Ala Leu Val 
50 55 60 

P-o Aso Lys Glu Val Leu Tvr Gin Gin Tyr Aso Glu Met Glu Glu Cys 
65 * 70 75 80 

Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His Gin 
85 90 95 

Phe Lys Glu Lys He Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin Gin 
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105 



Ala Val lie Glu Pro Val He Ala Ser Asn Trp Gin Lys Leu Glu Thr 
115 120 12 s 

Phe Trp His Lys His 
130 

(2) INFORMATION rOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(3) CLONE: 3R36-20-164 

(A) NAME /KEY: CDS 

(B) LOCATION: 3 . .401 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 35: 

TC CAA AAT GAA ATC TGC TTG ACA CAC CCC ATC ACA AAA TAC ATC ATG 4 7 

Gin Asn Glu lie Cys Leu Thr His Pro He Thr Lys Tyr Il« Me*- 

1 5 10 " ~ " 15 

GCA TGC ATG TCA GCT GAT CTG GAA GTA ACC ACC AGO ACC TGG GTT TTG 9 5 

Ala Cys Met Ser Ala Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu 
20 25 " 30 

CTT GGA GGG GTC CTC GCG GCC CTA GCG GCC TAC TGC TTG TCA GTC GGT 14 3 

Leu Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly 
35 40 45 

TGT GTT GTG ATT GTG GGT CAT ATC GAG CTG GGG GGG AAG CCG GCA ATC 191 

Cys Val Val He Val Gly His He Glu Leu Gly Gly Lys Pro Ala He 
50 55 50 



GTT CCA GAC AAA GAG GTG TTG TAT CAA CAA TAC GAT GAG ATG GAA GAG 
Val Pro Asp Lys Glu Val Leu Tyr Gin Gin Tyr Asp Glu Met Glu Glu 
65 70 75 



!39 



TGC TCA CAA GCT GCC CCA TAT ATC GAA CAA GCT CAG GTA ATA GCT CAC 23 7 

Cys Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His 

8 0 85 90 95 

CAG TTC AAG GGA AAA GTC CTT GGA TTG CTG CAG CGA GCC ACC CAA CAA 33 5 
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Gin Phe Lys Gly Lys Val Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin 
100 105 no 



CAA GCT GTC ATT GAG CCC ATA GTA ACT ACC AAC TGG CAA AAG CTT GAG 
Gin Ala Val Xle Glu Pro He Val Thr Thr Asn Trp Gin Lys Leu Glu 
115 120 125 



333 



GCC TTT TGG CAC AAG CAT 
Ala Phe Trp His Lys His 
130 



401 



(2) INFORMATION FOR SEQ ID NO : 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13 3 ammo acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3o : 

Gin Asn Glu He Cys Leu Thr His Pre He Thr Lvs Tvr He Me: Ala 
1 5 10 " * is 



Cys Met: Ser Ala Asp Leu Glu Val Thr Thr Ser Thr Tro 



va_ i^eu Le\ 



Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys 
35 40 45 

Val Val He Val Gly His He Glu Leu Gly Gly Lys Pro Ala He Val 
50 55 " 60 

Pro Asp Lys Glu Val Leu Tyr Gin Gin Tyr Asp Glu Mec Glu Glu Cvs 
65 70 75 30 

Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His Gin 
85 90 95 

Phe Lys Gly Lys Val Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin Gin 
100 105 no 

Ala Val He Glu Pro He Val Thr Thr Asn Trp Gin Lys Leu Glu Ala 
115 120 125 

Phe Trp His Lys His 
130 

(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 401 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE : 

(3) CLONE: 3R36-20-I66 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 401 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

TC CAA AAT GAA ATC TGC TTG ACA CAC CCC ATC ACA AAA TAC ATC ATG ^ 4 7 

Gin Asn Glu He Cys Leu Thr His Pro He Thr Lys Tyr He Me: 
Q 1*5 10 15 

m GCA TGC ATG TCA GCT GAT CTG GAA GTA ACC ACC AGO ACC TGG GTT TTG 

%j Ala Cys Met Ser Ala Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu 

Ui 20 25 30 



GCC TTT TGG CAC AAG CAT 
Ala Phe Trp Ha.s Lys His 
130 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 133 amino acids 



95 



143 



CTT GGA GGG GTC CTC GCG GCC CTA GCG GCC TAC TGC TTG TCA GTC GGT 
\2 Leu Giy Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly 

^ 35 40 4 5 

5 TGT GTT GTG ATT GTG GGT CAT ATC GAG CTG GGG GGC AAG CCG GCA ATC 19 

0** Cys Val Val He Val Gly His He Glu Leu Gly Gly Lys Pro Ala He 

O 5 0 5 5 6 0 



239 



23' 



GTT CCA GAC AAA GAG GTG TTG TAT CAA CAA TAC GAT GAG ATG GAA GAG 
Val Pro Asp Lys Glu Val Leu Tyr Gin Gin Tyr Asp Glu Mec Glu Glu 
65 70 75 

TGC TCA CAA GCT GCC CCA TAT ATC GAA CAA GCT CAG GTG ATA GCT CAC 
Cys Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His 
80 85 90 95 

CAG TTC AAG GAA AAA GTC CTT GGA TTG CTG CAG CGA GCC ACC CAA CAA 33 5 

Gin Phe Lys Glu Lys Val Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin 
100 105 HO 

CAA GCT GTC ATT GAG CCC ATA GTA ACT ACC AAC TGG CAA AAG CTT GAG 38 3 

Gin Ala Val He Glu Pro He Val Thr Thr Asn Trp Gin Lys Leu Glu 
115 120 125 



401 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 38: 

Gin Asn Glu lie Cys Leu Thr His Pro lie Thr Lys Tyr He Met: Ala 
1 5 10 15 

Cys Met Ser Ala As? Leu Glu Vai Thr Thr Ser Thr Trp Val Leu Leu 

2 0 25 3 0 

Gly Giy Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys 
35 40 45 

Val Val He Val Giy His lie Glu Leu Giy Gly Lys Pro Ala He Vai 
50 55 60 

Pro Asp Lvs Glu Val Leu Tyr Gin Gin Tyr Asp Glu Me: Glu Glu Cys 
65 70 75 30 

Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His Gin 
85 90 95 

Phe Lys Glu Lys Vai Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin Gin 
100 105 HO 

Ala Val He Glu Pro He Val Thr Thr Asn Trp Gin Lys Leu Glu Ala 
115 120 125 

Phe Trp His Lys His 
130 

(2} INFORMATION FOR SEC ID NC : 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 base pairs 
{B) TYPE : nucleio acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE : 

(3) CLONE: BR35-20-155 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
(3) LOCATION: 3 . .401 



(xi) SEQUENCE DESCRIPTION* : SEQ ID NO: 39: 
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TC CAA AAT GAA ATC TGC TTG ACA CAC CCC ATC ACA AAA TAC ATC ATG 
Gin Asn Glu lie Cys Leu Thr His Pro lie Thr Lys Tyr lie Mec 
15 10 is 



47 



GCA TGC ATG TCA GCT GAT CTG GAA GTA ACC ACC AGC ACC TGG GTT TTG 
Ala Cys Met Ser Ala Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu 
20 25 " 30 



95 



CTT GGA GGG GTC CTC GCG GCC CTA GCG GCC TAC TGC TTG TCA GTC GGT 
Leu Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly 



40 



143 



TGT GTT GTG ATT GTG GGT CAT ATC GAG CTG GGG GGC AAG CCG GCA ATC 
Cys Val Val lie Val Gly His lie Glu Leu Gly Gly Lys Pro Ala lie 
50 55 60 



191 



GTT CCA GAC AAA GAG GTG TTG TAT CAA CAA TAC GAT GAG ATG GAA GAG 
Val Pro Asp Lys Glu Val Leu Tyr Gin Gin Tyr Asp Glu Me- Glu Glu 
65 70 75 



239 



TGC TCA CAA GCT GCC CCA TAT ATC GAA CAA GCT CAG GTA ATA GCT CAC 
Cys Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val lie Ala His 
80 85 90 95 



CAG TTC AAG GAA AAA GTC CTT GGA TTG CTG CAG CGA GCC ACC CAA CAA 
Gin ?he Lys Giu Lys Val Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin 
100 105 no 



335 



CAA GCT GTC ATT GAG CCC ATA GTA ACT ACC AAC TGG CAA AAG CTT GAG 
Gin Ala Val lie Giu Pro lie Val Thr Thr Asn Trp Gin Lys Leu Giu 
115 120 125 



33: 



GCC TTT TGG CAC AAG CAT 
Ala Phe Trp His Lys Kis 
130 



401 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 13 3 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Gin Asn Glu He Cys Leu Thr His Pro He Thr Lys Tyr He Met Ala 
15 10 15 

Cys Met Ser Ala Asp Leu Giu Val Thr Thr Ser Thr Trp Val Leu Leu 
20 25 30 

Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys 
35 40 45 
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Val Val He Val Gly His He Glu Leu Gly Gly Lys Pro Ala He Val 
50 55 60 

Pro Asp Lys Glu Val Leu Tyr Gin Gin Tyr Asp Glu Met Glu Glu Cys 
65 70 75 80 

Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His Gin 
85 90 9= 

Phe Lys Glu Lys Val Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin Gin 
100 105 HO 

Ala Val He Glu Pro He Val Thr Thr Asn Trp Gin Lys Leu Glu Ala 
115 120 1-5 

Phe Trp His Lys His 
130 



-is? 

SiKSS. 

5 S £ 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 509 base pairs 
(3) TYPE: nucleic acid 

(C) STRANCEENES3 : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cCNA 
(ill) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



□ 

(vii) IMMEDIATE SOURCE : 
(B) CLONE: FC-2-1 

(ix) FEATURE: 

(A) NAME /KEY: CCS 
(3) LOCATION: 3 . - 50 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CC ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACC 4 7 

M»t S*- Thr Asn Pro Lvs Pro Gin Arg Lys Thr Lys Arg Asn Thr 
15 10 15 



AAC CGT CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ACC GTT 
Asn Arg Arg Pro Gin Aso Val Lys Phe Pro Gly Gly Gly Gin He Val 
20 25 30 

GGC GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGG ATG GGT GTG CGC 
Giv Glv Val Tvr Leu Leu Pro Arg "Arg Gly Pro Arg Met Gly Val Arg 
35 40 45 

GCG ACT CGG AAG ACT TCG GAA CGG TCG CAA CCC CGT GGA CGG CGT CAG 
Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin 



95 



191 
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SO 55 60 

CCT ATT CCC AAG GCG CGC CAG CCC ACG GGC CGG TCC TGG GGT CAA CCC 23 9 

Pro lie Pro Lys Ala Arg Gin Pro Thr Gly Arg Ser Trp Gly Gin Pro 
65 70 75 

GGG TAC CCT TGG CCC CTT TAC GCC AAT GAG GGC CTC GGG TGG GCA GGG 287 
Gly Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly T:ro Ala Glv 
SO 85 90 95 

TGG CTG CTC TCC CCT CGA GGC TCT CGG CCT AAT TGG GGC CCC AAT GAC 33 5 

Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn As? 

100 105 no 

CCC CGG CGA AAA TCG CGT AAT TTG GGT AAG GTC ATC GAT ACC CTA ACG 3 83 

Pro Arg Arg Lys Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr 
115 120 " 125 

TGC GGA TTC GCC GAT CTC ATG GGG TAT ATC COG CTC GTA GGC GGC CCC " 431 
Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Gly Pro 
130 135 140 

ATT GGG GGC GTC GCA AGG GCT CTC GCA CAC GGT GTG AGG GTC CTT GAG 4 79 

He Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu 
145 150 155 

GAC GGG GTA AAC TAT GCA ACA GGG AAT TTA 50 9 

Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu 
160 155 



(2) INFORMATION FOR 5EQ 10 NO: 42: 

(i) SEQUENCE CHAPACTEHISTICS : 
(A) LENGTH: 15 9 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Met: Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Ala Arg Gin Pro Thr Gly Arg Ser Trp Gly Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly Trp 
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85 90 



95 



Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp Pro 

100 io5 no 

Arg Arg Lys Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Gly Pro He 



L3Q 



140 



Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu As? 
145 ^0 155 " 16 q 

Gly Val Asn Tyr Ala Thr Gly Asn Leu 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHAaACTS?.ISTICS : 

(A) LENGTH: 50 9 base pairs 

(B) TYPE: nucleic acid 

(C) STSANEEI-NZSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cCNA 
(iii) ITf?C?KE"CAL: NO 
(-ii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 
{3} CLONE: PC-2-5 

<ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION : 3 . .509 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 43: 

CC ATG AGO ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACC 4 7 
Met Ser Thr Asn Pro Lys Pro Gin Arg Lvs Thr Lvs Arg Asn Tb- 

1 5 lo 15 

AAC CGT CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC GTT 9 5 

Asn Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie- Val 

20 25 30 

GGC GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGG ATG GGT GTG CGC 14 3 

Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Met Gly Val Arg 

35 40 45 

GCG ACT CGG AAG ACT TCG GAA CGG TCG CAA CCC CGT GGA CGG CGT CAG 191 

Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin 

50 55 60 
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CCT ATT CCC AAG GGG CGC CAG 
Pro He Pro Lys Ala Arg Gin 
65 70 

GGG TAC CCT TGG CCC CTT TAC 
Gly Tyr Pro Trp Pro Leu Tyr 
30 85 

TGG CTG CTC TCC CCT CGA GGC 
Trp Leu Leu Ser Pro Arg Gly 
100 

CCC CGG CGA AAA TCG CGT AAT 
Pro Arg Arg Lys Ser Arg Asn 
115 



132 



PCT/EP94/0D23 



CCC 
Pro 



GCC 
Ala 



TCT 
Ser 



TTG 
Leu 



ACG GGC CGG TCC TGG GGT CAA CCC 23 9 

Thr Gly Arg Ser Trp Gly Gin Pro 
75 

AAT GAG GGC CTC GGG TGG GCA GGG 287 
Asn Glu Gly Leu Gly Trp Ala Gly 
90 * 95 

CGG CCT AAT TGG GGC CCC AAT GAC 335 
Arg Pro Asn Trp Gly Pro Asn Aso 
105 no 

GGT AAG GTC ATC GAT ACC CTA ACG 383 
Gly Lys Val He Asp Thr Leu Thr 
120 125 



TGC GGA TTC GCC GAT CTC ATG GGG 
Cys Gly Phe Ala Asp Leu Me: Gly 
130 135 



TAT ATC CCG CTC GTA GGC GGC CCC 
Tyr He Pro Leu Val Gly Gly Pro 
140 



431 



ATT GGG GGC GTC GCA AGG 
He Gly Gly Val Ala Arg 
145 



GCT 
Ala 

150 



CTC GCA CAC GGT GTG AGG GTC CTT GAG 
Leu Ala His Gly Val Arg Val Leu Glu 
153 



479 



GAC GGG GTA AAC TAT GCA ACA GGG AAT TTA 
Asp Gly Val Asn Tyr Ala Thr Gly Asr. Leu 
ISO ±6 3 



509 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 16 9 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 lo 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 



20 



25 



30 



Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Met Gly Val Arg Ala 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 go 

lie Pro Lys Ala Arg Gin Pro Thr Gly Arg Ser Trp Gly Gin Pro Gly 

65 70 75 80 



Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 
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Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp Pro 
100 105 no 

Arg Arg Lys Ser Arg Asn Leu Gly Lys Val lie As? Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Gly Pro lie 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 iso 



Gly Val Asn Tyr Ala Thr Gly Asn Leu 
155 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58G base pairs 

(B) TYPE: nucleic acid 

(C) ST2ANDEDNE33 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: PC-4-1 

(ix) FEATURE: 

(A) NAME / KEY : CDS 
(3) LOCATION: 2 . . 530 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

A ACG TGC GGA TTC GCC GAT CTC ATG GGG TAT ATC CCG CTC GTA GGC 4 6 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly 
15 10 15 

GGC CCC ATT GGG GGC GTC GCA AGG GCT CTC GCA CAC GGT GTG AGG GTC 94 
Gly Pro lie Gly Gly Val Ala Arg Ala Leu Ala Kis Gly Val Arg Val 
20 25 30 • 

CTT GAG GAC GGG GTA AAC TAT GCA ACA GGG AAT TTA CCC GGT TGC TCT 14 2 

Leu Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser 
35 40 45 

TTC TCT ATC TTT ATT CTT GCT CTT "CTC TCG TGT CTG ACC GTT CCG GCC 190 
Phe Ser He Phe lie Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala 
50 55 60 

TCT GCA GTT CCC TAC CGA AAT GCC TCT GGG ATT TAT CAT GTT ACC AAT 23 8 
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Ser Ala Val Pro 
65 

GAT TGC CCA AAC 
Asp Cys Pro Aon 

80 

CAC GCA CCT GGT 
His Ala Pro Glv 



Tyr Arg 



TCT TCC 
Ser Ser 
85 

TGC GTG 
Cys Val 
100 



Asn 
70 

ATA 



CCT 
Pro 



Ala 



t34 

Ser Gly He 
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He T 



ACC CCT 
Pro 



T CGG 
Leu Arg 



TGC TGG GTC CAA 
Cys Trp Val Gin 
115 

GTC ACG GCT CCT 
Val Thr Ala Pro 
130 

GCC CTC TGC TCC GCG TTA 
Ala Leu Cys Ser Ala Leu 
145 

TTG GTA GGC CAA 
Leu Val Gly Gin 

160 

CAG AAC TGC AAC 
Gin Asn Cys Asn 



AGA 
Arg 



GTC 
Val 



TGT 
Cys 



ACA 
Thr 



GCC 
Ala 
135 



TAT GAG GCA 
Tyr Glu Ala 
90 

GTC ATG ACA 
Val Mec Thr 
105 

CTG TCA GCC 
Leu Ser Ala 
120 

GTT GAC TAC 
Val Asp Tyr 



Tyr His Val Thr 
75 

GAT AAC CTG ATC 
Asp Asn Leu He 



GGT AAT 
Gly Asn 



CCG AGC 
Pro Ser 



ATG TTC 
Met Phe 

15 5 

TGT TCC 
Cys Ser 
180 



TAC GTA 
Tyr Val 
150 

ACC TAT 
Thr Tyr 



ATT 
lie 



TAC 
Tyr 



GGA GAC GCG 
Gly Asd Ala 



AGG CCT CGC 
Arg Pro Arg 

170 

ACT GGC CAT 
Ser Gly Kis 
125 



CTA GCG 
Leu Ala 
140 

TGT GGG 

Cys Gly 
155 

CAG CAC 
Gin His 



GTG AGT 
Val Ser 
110 

CTC GGA 
Leu Gly 
125 

GGA GGG 
Gly Gly 



GCA CTA 
Ala Leu 



GCT ACG 
Ala Thr 



GTT ACC GGC CAC 
Val Thr Gly His 
130 



Asn 



CTA 
Leu 
95 

AGA 
Ara 



GCA 
Ala 



GCT 
Ala 



Phe 



Val 



CGG 
Arc 



2B6 



334 



3s: 



130 



: 73 



ATG GCA 
Met Ala 



53 0 



(2) INFORMATION -OR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 193 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 46: 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Gly 
1 5 io !5 

Pro He Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 
20 25 30 

Glu As? Gly Vai Asn Tyr Ala Thr -Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Phe lie Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 * 60 
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Ala Val Pro Tyr Arg Asn Ala Ser Gly lie Tyr His Val Thr Asn Asp 

65 70 75 80 

Cys Pro Asn Ser Ser He Val Tyr Glu Ala Asp Asn Leu He Leu His 

85 90 95 



Ala Pro Gly Cys Val Pro Cys Val Met 
100 105 



110 



Trp Val Gin lie Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val 
115 120 125 

Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly Ala Ala 
130 135 14C 

Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu 
145 150 135 ISO 

Val Gly Gin Met: Phe Thr Tyr Arg Pro Arg Gin His Ala Thr Val Gin 
165 170 175 

Asn Cys Asn Cys Ser He Tyr Ser Gly His Val Thr Gly His Arg Met: 
130 1S5 190 



Ala 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 0 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNE3S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: PC-4-S 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2 . . 580 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 47: 

A ACG TGC GGA TTC GCC GAT CTC ATG GGG TAT ATC CCG CTC GTA GGC 
Thr Cys Gly Phe Ala Asp Leu Mec Gly Tyr He Pro Leu Val Gly 
15 10 15 

GGC CCC ATT GGG GGC GTC GCA AGG GCT CTC GCA CAC GGT GTG AGG GTC 
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Giy Pro He Gly Gly Val Ala Arg Ala Leu Ala His Glv Va' Arg Val 
20 25 3= 

C7T GAG GAC GGG GTA AAC TAT GCA ACA GGG AAT TTA CCC GGT TGC TCT 

Leu Glu Asp Glv Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser 

35 43 45 

TTC TCT ATC TTT ATT CTT GCT CTT CTC TCC- TG~ 



Phe Ser He Phe lie Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Al 



50 • 55 



TCT GCA GTT CCC TAC CGA AAT GCC TTT GGG ATT TAT CAT GTT ACC AAT 
Ser Axa Val Pro Tyr Arg Asn Ala Ser Gly Ue Tyr His Val Thr Asn 



S3 70 



/ 2 



GAT TGC CCA AAC TCT TCC ATA GTC TAT GAG GCA GAT AAC CTG ATC C^A 

Asp Cys Pro Asn Ser Ser lie Val Tyr Glu Ala Aso Asn Leu lie Leu 

85 90 9= 

CAC GCA CCT GGT TGC GTG CCT TGT GTC ATG ACA GGT AAT GTG AGT AGA 

h-s Ala Pro Gly Cys Val Pro Cys Val Me- Thr Glv Asr. Val Ser Arc 

100 105 ' " 110 

TGC TGG GTC CAA ATT ACC CCT ACA CTG TCA GCC CCG AGC CTC GGA GCA 

Cys Trp Val Gin He Thr Pro Thr Leu Ser Ala La- G-/ V a 
115 120 



L23 



f/r *f C ™ ^ CGG AGA GCC G ~ GAC TAC CTA GCC- GGA GGG GCT 

Vai Thr Axa Pro Leu Arg Arg Ala Val Aso Tvr Leu Ala Glv GW A 1 a 
130 135 140 ' " 



TTG GTA GGC CAA ATG TTC ACC TAT AGC- CCT CGC CAG CAC GCT ACG GTG 
Leu Val Gly Gin Me= Phe Thr Tyr Arg Pro Arg Gin His Ala Thr Vai 

1/C 175 

CAG AAC TGC AAC TGT TCC ATT TAC AGT GGC CAT GTT ACC GGC CAC CGG 
Gin Asn Cys Asn Cys Ser lie Tyr Ser Gly His Vai Thr Gly His Are 
130 135 * 19Q 

ATG GCA 
Met Ala 



14: 



•j — ^j.. CCG GCC 190 
a 



238 



28* 



334 



38; 



S GCC CTC TGC TCC GCG TTA TAC GTA GGA GAC GCG TGT GGG GCA CTA T^C 47; 

Z: Aia Leu Cys Ser Ala Leu Tyr Vai Giy As? Ala Cys Glv Ala Leu Phe 

145 150 15- 



S2S 



574 



580 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 193 ammo acids 
(3) TYPE: anino acid 
(D) TOPOLOGY : linear - 

( ii ) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
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Thr Cys Gly Phe Ala As? Leu Met Gly Tyr lie Pro Leu Val Gly Gly 
15 10 15 

Pro He Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 
20 25 30 

Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Phe He Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 

Ala Val Pro Tyr Arg Asn Ala Ser Gly He Tyr His Val Thr Asn Asp 
65 70 75 80 

Cys Pro Asn Ser Ser lie Val Tyr Glu Ala Asp Asn Leu He Leu His 
85 90 95 

Ala Pro Gly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser Arg Cys 
100 105 110 

Trp Val Gin He Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val 
115 120 125 

Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly Ala Ala 
130 135 140 

Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu 
145 150 155 160 

Val Gly Gin Met Phe Thr Tyr Arg Pro Arg Gin Kis Ala Thr Val Gin 
165 170 175 

Asn Cys Asn Cys Ser He Tyr Ser Gly His Val Thr Gly His Arg Met 
130 135 190 

Ala 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 95 9 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND ID NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(lii) ANT I- SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(3) CLONE: PC-3-- 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 959 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 9 : 

CC ATG AGC AC3 AAT CCT AAA CCT CAA AGA AAA AC" V - ^ r- 

Mez Ser Thr Asa Pro Lvs Pro Gin Arg L vs C T~ ~^ AC ~ 4 7 

x ^ i,-a Arg U ys Thr Lys Arg Asa Thr 

10 X5 

AAC CGT CGC CCA CAG GAC GTC AAG ^ C ~ ■ , 

Asn Arg Arg Pro Gin Asp Val Lys ^ p ~ Gly GIv G 13 ^ ^ ^ 

20 23 - ^ - 

GGC GGA GTT TAC TTG TTG CCG CGC AGG GGC C~- AC a — n~~ 

«y v,x ^ ^ Leu Pro ^ ^ £ £ a.. ™ « 



40 A - 

4a 



GCG ACT CGG AAG ACT TCG GAA CGG TCG CAA C~- c~~ 

Ala Thr Arg Lvs Thr Ser Giu Arc Ser ^1 ?T ^ CGT ^ G 

50 5 r "* u = Arg Gin 



55 



CCT ATT CCC AAG GCG CGC CAG CCC ACG or- r-~ 

Pro I!- 0 _ 0 r v _ ^" ACj ^ c ~ TCC TGG GGT CAA CCC 

65 ' 70 " G1/ ^= a « -~? Gly cia Pro 
GGC- TAC CCT 



« o , ^ ,„ £ S^SSS £ £ 5 iz £ - 

83 90 95 

f C f C C " TCT CG ° CC - *K TGG GGC CCC AA^ G - 

irp Leu Leu Se~ A , — nl c ^ Uwv - 

.-3 *- a Gly Ser Arg Pro As, Trp Gly Pro As, Asp 



105 110 



CCC CGG CGA AAA. TCG CG~ T~r- ~ 

- - - - ~ - - * - - 2 S £ - - 



120 



TGC GGA TTC GCC GAT CTC ATG GGG TA" A^C rr- r-- , ~ 

c " Giy lit "* - - - ^ £ £ £ t:: or.; £ s 

J.** u 

ATT GGG GGC GTC GCA AGG GCT CTC GCA CAC G~r r~r 

lie Gly Gly Va^ AT a A— Al a r AGG GTC CTT GAG 

145 a SU Ala HiS Gl * Val ^5 Val Leu Glu 

GAC GGG GTA AAC TAT GCA ACA GGG AA~ TT" r~r> r™ -v- 

Aqo rii, v,i » _ ° HAi iXn G-C GvjT TGC TCT TTC tct 

ASP G.y val Asn Tyr Ma Thr Gly As, Leu Pro Gly Cys Ser Phe III 

163 ^ 175 

ATC TTT ATT CTT GCT CTT CTC TCG TG- CTG AC r - r-~ 

lie Ph- ti q La „ fina . , T „ CTG AO - G -- GCC TCT GCA 

ISO ° yS LeU Thr Val ? ™ A " a ^r Ala 

130 1S = 190 

GTT CCC TAC CGA AAT GCC TCT GGG ATT TA- C- C~ i~ a 

v.i p.-o ^ % ^ „ r G!y xi . - ;^ - S; ^ 

200 2Q5 



95 



227 



33: 



527 



623 
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CCA AAC TCT TCC ATA GTC TAT GAG GCA GAT AAC CTG ATC CTA CAC GCA 
Pro Asn Ser Ser lie Val Tyr Glu Ala Asp Asn Leu lie Leu His Ala 
210 215 220 



67 



CCT 
Pro 



' GTC 
Val 
240 



GGT 
Gly 



CAA 
Gin 



TGC 
Cys 



ATT 
He 



GTG 
Val 



ACC 
Thr 



Pro 



CCT 
Pro 



TGT GTC ATG ACA 
Cys Val Met Thr 

230 

ACA CTG TCA GCC 
Thr Leu Ser Ala 



GGT 
Gly 



CCG 
Pro 



AAT GTG AGT 
Asn Val Ser 

235 

AGC CTC GGA 
Ser Leu Gly 

250 



AGA TGC TGG 
Arg Cys Trp 



GCA GTC ACG 
Ala Val Thr 
255 



76 



GCT 
Ala 



CC^ 
Pro 



CTT 
Leu 



CGG 



AGA 
Arg 
260 



GCC GTT GAC TAC 
Ala Val Aso Tvr 



CTA 
Leu 
265 



GCG GGA GGG 
Ala Glv Glv 



GCT GCC CTC 
Ala Ala Leu 
270 



31 



TGC TCC GCG TTA TAC GTA GGA GAC GCG TGT GGG GCA CTA TTC TTG GTA 
Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu Val 
275 23C 285 



GGC CAA ATG TTC ACC TAT AGO CCT CGC CAG 
Gly Gin Met Phe Thr Tyr Arg Pro Arg Gln 
290 295 



CAC GCT ACG 
His Ala Thr 

300 



GTG CAG AAC 
Val Gin Asn 



91 



TGC 

Cys 



AAC 
Asn 
305 



TGT 

Cvs 



Ser 



ATT TAC AGT GGC CAT 
lie Tyr Ser Gly His 
310 



GTT 

Val 



ACC GGC CAC 
Thr Glv His 
315 



CGG ATG GCA 
Arc Met Ala 



95 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHAP-ACTSRISTICS : 
(A) LENGTH: 319 amine acids 
(3} TYPE : amine acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Met Gly Val Arg' Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Ala Arg Gin Pro Thr "Gly Arg Ser Trp Gly Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 



SUBSTITUTE SHEET (RULE 26) 




WO 9412560^^ FCT/EP94/01323 

140 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp Pro 

100 105 * 110 

Arg Arg Lys Ser Arg Asn Leu Gly Lys Val lie Aszs Thr Leu Thr Cys 
115 120 " 12 5 

Gly Phe Ala Asp Leu Met Gly Tyr lie ?r= Leu Val Glv Gly Pro lie 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 -50 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe He Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Val 
180 135 190 

Pro Tyr Arg Asn Ala Ser Gly He Tyr His Val Thr Asn Asp Cys Pro 
135 200 205 

Asn Ser Ser lie Val Tyr Glu Ala Asp Asn Leu He Leu His Ala Pro 
210 215 220 

Gly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser Ar=r Cvs Trp Val 
225 230 235 ~ 240 

Gin He Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val Thr Ala 
243 250 255 

Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys 
260 255 270 

Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu Val Gly 
275 230 235 

Gin Met Phe Thr Tyr Arg Pro Arg Gin His Ala Thr Val Gin Asn Cys 
290 295 300 

Asn Cys Ser He Tyr Ser Gly His Val Thr Gly K-3 Arg Met Ala 
305 310 315 

(2) INFORMATION FOR SEQ ID NO : 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(ill) HYPOTHETICAL : NO 

(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCH 
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(B) CLONE: PC-3-8 



( ix ) FEATURE : 

(A) NAME / KEY : CDS 
(3) LOCATION: 3 . . 959 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 51: 

CC AT 3 AGC AC 3 AAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACC 4 7 

Met: Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr 

1.5 10 15 

AAC CGT CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GOT CAG ATC GTT 95 

Asn Arg Arg Pro Gin Asp Val Lys Phs Pro Gly Gly Gly Gin lie Val 

20 25 30 

GGC GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGG ATG GGT GTG CGC 14 3 

Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Mez Gly Val Arg 
35 40 45 

GCG ACT CGG AAG ACT TCG GAA CGG TCG CAA CCC CGC GGA CGG CGT CAG 191 

Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin 
50 55 60 

CCT ATT CCC AAG GCG CGC CAG CCC ACG GGC CGG TCC TGG GGT CAA CCC 23 9 

Pro lie Pro Lys Ala Arg Gin Pro Thr Gly Arg Ser Trp Gly Gin Pro 
65 70 75 

GGG TAC CCT TGG CCC CTT TAC GCC AAT GAG GGC CTC GGG TGG GCA GGG 23 7 

Gly Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly 
80 35 90 95 

TGG CTG CTC TCC CCT CGA GGC TCT CGG CCT AAT TGG GGC CCC AAT GAC 33 5 

Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp 

100 105 no 

CCC CGG CGA AAA TCG CGT AAT TTG GGT AAG GTC ATC GAT ACC CTA ACG 3 33 

Pro Arg Arg Lys Ser Arg Asn Leu Gly Lys Val lie As? Thr Leu Thr 
115 120 125 

TGC GGA TTC GCC GAT CTC ATG GGG TAC ATC CCG CTC GTA GGC GGC CCC 431 

Cys Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Gly Pro 
130 135 140 

GTT GGG GGC GTC GCA AGG GCT CTC GCA CAC GGT GTG AGG GTC CTT GAG 47 3 

Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu 
145 150 155 

GAC GGG GTA AAC TAT CCA ACA GGG AAT TTA CCC GGT TGC TCT TTC TCT 52 7 

Asp Gly Val Asn Tyr Pro Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser 
160 165 170 175 

ATC TTT ATT CTT GCT CTT CTC TCG "TGC CTG ACC GTT CCG GCC TCT GCA 5 75 

lie Phe lie Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala 

130 195 190 

GTT CCC TAC CGA AAT GCC TCT GGG ATT TAT CAT GTT ACC AAT GAT TGC 6 23 
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Val Pro Tyr Arg Asn Ala 
195 



Ser 



142' 

Giy lie Tyr 
200 
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His Val 



CCA AAC TCT TCC ATA GTC TAT GAG GCA GAT AAC CTG 
Pro Asn Ser Ser lie Val Tyr Glu Ala Asp Asn Leu 
210 215 

CCT GGT TGC GTG CCT TGT GTC ATG ACA GGT AAT GTG 
Pro Gly Cys Val Pro Cys Val Mec Thr Gly Asn Val 
225 230 7^ 



Thr Asn Asp Cys 
205 

ATC CTA CAC GCA 

lie Leu His Ala 
220 

AGT AGA TGC TGG 

Ser Arg Cys Trp 



671 



GTC CAA ATT ACC CCT ACA 
Val Gin lie Thr Pro Thr 
240 245 

GCT CCT CTT CGG AGA GCC 
Ala Pro Leu Arg Arg Ala 

26 0 



Leu 



GTT 
Val 



TCA GCC CCG 
Ser Ala Pro 



GAC TAC CTA 
Asp Tyr Leu 
265 



AGC CTC 
Ser Leu 
250 

GCG GGA 
Ala Glv 



TGC TCC GCG TTA TAC GTA GGA GAC GCG TGT GGG GCA 
Cys Ser Ala Leu Tyr Val Giy Asp Ala Cys Giy Ala 
275 280 



GGC CAA ATG TTC ACC TAT 
Gly Gin Mec Phe Thr Tyr 
290 



AGC 



CCT CGC CAG 
Pro Arg Gin 
295 



TGC AAC TGT TCC ATT TAC AGT GGC CAT GTT 
Cys Asn Cys Ser lie Tyr Ser Giy His Val 
305 310 



CAC GCT 
His Ala 



ACC GGC 
Thr Giy 
315 



GGA GCA 
Glv Ala 



GGG GCT 
Giy Ala 



CTA TTC 
Leu Phe 
285 

ACG GTG 
Thr Val 
300 



GTC ACG 
Val Thr 
253 

GCC CTC 
Ala Leu 
270 

TTG GTA 
Leu Val 



CAG AAC 
Gin Asn 



CAC CC 



ATG GCA 
Me: Ala 



16' 



915 



863 



353 



(2) INFORMATION FOR SEQ ID NO: 52: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 319 amine acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 10 * is 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Giy Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Giy Pro Arg Met Giy Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Glv Arg Arg Gin Pre 
50 55 60 

lie Pro Lys Ala Arg Gin Pro Thr Gly Arg Ser Trp Giy Gin Pro Giy 
65 70 75 ao 
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Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg Arg Lys Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly ?he Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Gly Pro Val 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Pro Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 
165 170 175 

Phe lie Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Val 
130 135 190 

Pro Tyr Arg Asn Ala Ser Gly lie Tyr His Val Thr Asn Asp Cys Pre 
195 200 205 

Asn Ser Ser lie Val Tyr Glu Ala Asp Asn Leu lie Leu Kis Ala Pro 
210 215 22C 

Gly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser Arg Cys Trp Val 
225 230 235 240 

Gin lie Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val Thr Ala 
245 250 255 



Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys 
260 265 270 



Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu Val Gly 
275 280 235 

Gin Met: Phe Thr Tyr Arg Pro Arg Gin His Ala Thr Val Gin Asn Cys 

290 295 300 



Asn Cys Ser lie Tyr Ser Gly His Val Thr Gly His Arg Met Ala 
305 310 315 



(2) INFORMATION FOR SEQ ID NO : 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 9 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
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(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE : 

(B) CLONE: PC C/El 

(ix) FEATURE: 

{ A } NAME / I-TEY : CD S 
(3) LOCATION : 2 . .95 5 

{xi} SEQUENCE DESCRIPTION: 5EQ 10 NO: 53. : 
CCA7GAGCAC GAATCCTAAA CCTCAAAGAA AAACCAAAAG AAACACCAAC CG7CGCCCAC SO 
AGGACGTCAA GTTCCCGGGC GGTGGTCAGA 7CG77GGCSG AGTTTACTTG TTGCCGCGCA 120 
GGGGCCCTAG GATGGGTGTG CGC3CGAC7C GG AAGACTT C GGAACGGTCG CAACCCCGTG 
GACGGCGTCA GCCTATTCCC AAGGCGCGCC AGCCCXCGGG CCGGTCCTGG GGTCAACC CG 
GGTACCCTTG GCCCCTTTAC GCCAA7GAGG GCC7CSGG7G GGCAGGGTGG CTGCTCTCCC 
CTCGAGGCTC TCGGCCTAAT TGGGGCICCA ATGACCCCCG GCGAAAATCG CG7AAT77GG 
GTAAGGTCAT CGATACCCTA ACGTGCGGAT TCGCCGATCT CATGGGGTAY ATCCCGCTCG 
TAGGCGGCCC CRTTGGGGGC G7C3CAAGGG C7C7CGCACA CGG7G7GAGG G7CC77GAGG 
ACGGGG7AAA C7ATSCAACA GGGAATT7AC CCGGTTGCTC 



I3C 
240 
300 
350 
420 
430 



:TAT7C77G 540 



C7C77C7C7C GTGTCTGACC GTTCOGGCCT C7GCAG77CC C7ACCGAAA7 GCCTCTGGGA 600 
T77A7CATG7 TAC CAATGA7 TGCCCAAACT CTTCCA7AG7 CTA7GAGGCA GA7AACC7GA 6 50 



720 



900 



o - a 



TCCTACACGC ACCTGGTTGC GTGCCT7G73 TCATGACAGG TAA7G7GAG7 AGA7GCTGGG 

TCCAAA77AC CCCTACAC7G TCAGCCCCGA GCCTCC-GAGC AG7CACGGCT CC7CTTCGGA 7 30 

GAGCCGTTGA C7ACCTAGCG GGAGGGGC7G CCC7C7GC7C CGCG77A7AC GTAGGAGACG 8 40 

CGTGTGGGGC ACTATTCT7G GTAGGCCAAA TGTTCACCTA TAGGCCTCGC CAGCACGC7A 

CGG7GCAGAA CTGCAACTGT TCCATT7ACA G7GGCCA7G7 TACCGGCCAC CGGA7GGCA 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 319 arr.ino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pr;:e:r. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Mec Ser Thr Asn Pro Lys Pro GI- Arg Lvs Thr Lvs Arg Asn Thr Asr. 
1 5 io is 
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Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pre Arg Arg Gly Pro Arg Met Gly Val Arg Ala 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

lis Pro Lys Ala Arg Gin Pro Thr Gly Arg Ser Trp Gly Gin Pro Gly 

65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp Pro 

100 105 110 

Arg Arg Lys Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 

115 120 * * 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Gly Pro Val 

130 135 140 



Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 



145 



150 



ISO 



Gly Val Asn Tyr Pro Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 

165 170 175 

Phe He Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Val 

130 135 190 

Pro Tyr Arg Asn Ala Ser Gly He Tyr His Val Thr Asn Asp Cys Pro 

195 200 205 

Asn Ser Ser He Val Tyr Glu Ala Asp Asn Leu He Leu His Ala Pro 

210 215 220 

Gly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser Arg Cys Trp Val 

225 230 235 ~ 240 

Gin He Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val Thr Ala 

245 250 255 

Pro Leu Arg Arg Ala Val As? Tyr Leu Ala Gly Gly Ala Ala Leu Cys 

260 255 270 

Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu Val Gly 

275 230 235 

Gin Met Phe Thr Tyr Arg Pro Arg Gin His Ala Thr Val Gin Asn Cys 

290 295 300 



Asn Cys Ser lie Tyr Ser Gly His Val Thr Gly His Arg Met Ala 
305 310 315 
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(2) INFORMATION FOR SEQ ID NO : 55: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TY?E : cDNA 
(iii} HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE : 

(B) CLONE: PC-i-37 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .3 54 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ACCACCGGAG CTTCTATCAC ATACTCCACT TACGGCAAGT TCCTTGCTGA TGGAGGGTGT 6 0 

TCAGGCGGCG CGCATGACGT GATCATATGC GACGAGTGCC ATTCCCAGGA CGCCACCACC 12 C 

ATTCTTGGGA TAGGCACTGT CCTTGACCAG GCAGAGACGG CTGGAGCTAG GCTCGTCGTC 13 0 

TTGGCCACGG NCACCCCTCC CGGCAGTGTG ACAACGCCCC ACCCCAACAT CGAGGAAGTG 24 0 

GCCC7GCCTC AGGAGGGGGA GGTTCCCTTC TACGGCAGAG CCAT7CCCCT TGCTTTTATA 3 00 

AAGGGTGGTA GG CAT CT CAT CTTCTGCCAT TCCAAGAAAA ATTGTGATGA ACTC 3 54 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 
(3) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Thr Thr Gly Ala Ser lie Thr" Tyr Ser Thr Tyr Gly Lys Phe Leu Ala 
15 10 15 

Asp Gly Gly Cys Ser Gly Gly Ala Kis Asp Val lie lie Cys Asp Giu 
20 25 30 



SUBSTITUTE SHEET (RULE 26) 




W0 94/25^^F PCT/EP94/0O23 

147 



Cys His Ser Gin Asp Ala Thr Thr He Leu Gly He Gly Thr Val Leu 
35 40 45 

Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val Leu Ala Thr Xaa 
50 55 60 

Thr Fro Pro Gly Ser Val Thr Thr Pro His Pro Asn He Glu Glu Val 
65 7G 75 80 

Ala Leu Pro Gin Glu Gly Glu Val Pro Phe Tyr Gly Arg Ala He Pro 
35 90 95 

Leu Ala Phe He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys 
ICQ 105 110 

Lys Asn Cys Asp Glu Leu 
115 

(2) INFORMATION ?C?. SEQ ID NO : 57: 
(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 3 5 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNZSS : single 

(D) TOPOLOGY: Lir.€ 

MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 



(vii) IMMEDIATE SOURCE : 

(3) CLONE: PC- 1-43 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
(3) LOCATION: I. .3 54 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ACCACCGGAG CTTCTATCAC ATACTCCACT TACGGCAAGT TCCTTGCTGA TGGAGGGTGT 60 

TCAGGCGGCG CGTATGACGT GATCATATGC GACGAGTGCC ATTCCCAGGA CGCCACCACC 12 0 

ATTCTTGGGA TAGGCACTGT CCTTGACCAG GCAGAGACGG CTGGAGCTAG GCTCGTCGTC ISO 

TTGGNCACGG NCACCCCTCC CGGCAGTGTG ACAACGCCCC ACCCCAACAT CGAGGAAGTG 2 40 

GCCCTGCCTC AGGAGGGGGA GGTTCCCTTC TACGGNAGAG CCATTCCCCT TGCTTTTATA 3 00 

AAGGGTGGTA GGCATCTCAT CTTCTGCCAT TCCAAGAAAA AATGTGATGA ACTT 3 54 
(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 13 3 amino acids 

(B) TYPE : amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5S: 

Thr Thr Gly Ala Ser He Thr Tyr Ser Thr Tyr Giy Lys Phe Leu Ala 
15 10 15 

Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp Val He He Cys Asp Glu 
20 25 30 

Cys His Ser Gin Asp Ala Thr Thr lie Leu Giy He Gly Thr Val Leu 
35 40 45 

As? Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val Leu Xaa Thr Xaa 
50 55 60 

Thr Pro Pro Gly Ser Val Thr Thr Pro H,s Pro Asn He Glu Glu Val 
65 70 75 SO 

Ala Leu Pro Gin Glu Gly Gl- Val Pro Phe Tyr Xaa Arg Ala He Pro 
85 SC 95 

Leu Ala Phe lie Lys Giy Giy Arg His Leu lie Phe Cys His Ser Lys 

100 105 110 

Lys Lys Cys Asp Glu Leu Arg Gin Ala Thr Asp Gin Pro Giy Arg Glu 
115 120 125 

Arg Pro Trp Glu Tyr 
130 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 57 base pairs 

(B) TYPE : nucleic acid 

(C) ST RAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cCNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: PC- I- 37 

(ix) FEATURE : 

(A) NAME /KEY : CDS 
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(B) LOCATION: 1..3 57 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

ATGGCTTTCA TGTCTCCGGA C7TGGAGGTC ATTACCANCA CTTGGGTTCT GGTGGGGGGC 6 0 

GTTGTGGCGA CCCTGNCGNC CTACTGITTG ACGGTGGGTT CGGTAGCCAT AGTCGGTAGG II 0 

ATCATCCTCT CTGGGAAACC TGCCATCATT NCCGATAGGG AGGTATTATA CCAGCAATTT 

GATGAGATGG AGGAGTGCTC GC-CCTCGTTG CCCTATATGG ACGAAACACG 7NCCATTGCC 

GGACAATTCA AAGAGAAAGT GCTCGGCTTC ATCAGCACGA CCGGCCAGAA GGCTGAAACT 

CTGAAGCCGG CAGCCACGTC TGTGTGGAAC AAGGCTGATC AGTTCTGGNC CACATAC 3 57 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 123 amino acids 
(3) TYPE: arr.ir.o acid 

(C) STRAND ED NESS : single 

(D) TCPCLCGV: linear 

(ii) MOLECULE TYPE : prccein 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO : 60: 

Met: Ala ?he Me: Ser Pro As? Leu Glu Val He Thr Xaa Thr Trp Val 
3 10 15 

Leu Val Glv Glv Val Val Ala Thr Leu Xaa Xaa Tyr Cys Leu Thr Val 
20* 15 30 

Glv Ser Val Ala He Val Gly Arg He He Leu Ser Giy Lys Pro Ala 
35 40 45 

lie lie Xaa Asc Arg Glu Val Leu Tyr Gin Gin Phe Asp Glu Mec Glu 

50 55 60 

G^u Cys S — Ala Ser Leu Pro Tyr Mec Asp Glu Thr Arg Xaa He Ala 
65 " 70 75 80 

Gly Glr Ph~ Lvs Glu Lvs Val Leu Gly Phe He Ser Thr Thr Gly Gin 
* 55 90 95 

Lys Ala Glu Thr Leu Lvs Pro Ala Ala Thr Ser Val Trp Asn Lys Ala 
100 ' 105 H° 

Asd Gin Phe Tro Xaa Thr Tyr Men Trp Asn Phe lie Ser Gly He Gin 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 3 57 base pairs 

(3) TYPE : nucleic acid 

(C) STRAND ED NESS : single 

(D) TOFCLCGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) ANTI-5SNSZ: NO 



(viij IMMEDIATE SOURCE: 

(3) CLONE: PC-i-43 

(IX) FEATURE: 

(A) N AiME / KEY : CDS 
(3) LOCATION: 1 . .357 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 : 

ATGGCTTGCA TGTCTGCGGA CCTGGAGGTC ATTACCANCA CTTGGGTTCT GGTGGGGGGC 

G7TGTGGCGN CZCZGGCZGZ CTACTGCTTG ACGGTGGGTT CGGTAGCCAT AGTCGGTAGG 

ATCATCCTCT CTGGGAAACC 7GCCACCATT CCCGATAGGG AGGCATTATA CCANCAATTT 



240 
300 
357 



130 

GATGAGATGG AGGAGTGCTC GGCCTCGTTG CCCTATATGG ACGAGACACG TGCCATTGCC 

GGACAATTCA AAG AG AAAG T GCTCGGCTTC ATCAGCACGA CCGGCCAGAA GGCTGAAACT 

CTGAAGCCGG CAGCCACGTC TGTGTGGAAC AAGGCTGANC AGTTCTGGGC CACATAC 

(2) INFORMATION FOR SZZ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 12 3 amino acids 
(3) TYPE: amino acid 
(0) STRANDEDNZS3 : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Ala Cys Met Ser Ala Asp Leu Glu Val lie Thr Xaa Thr Trp Val 
15 10 15 

Leu Val Gly Gly Val Val Ala Xaa Leu Ala Ala Tvr Cvs Leu Thr Val 
20 25 30 

Gly Ser Val Ala lie Val Gly- Arg lie lie Leu Ser Gly Lys Pro Ala 
3 5 40 45" 

He He Pro Asp Arg Glu Ala Leu Tyr Xaa Gin Phe Aso Glu Met Glu 
50 55 S0 
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Glu Cys Ser Ala Ser Leu Pro Tyr Met Asp Glu Thr Arg Ala lie Ala 
65 70 75 80 

Gly Gin Phe Lys Glu Lys Val Leu Gly Phe lie Ser Thr Thr Gly Gin 
83 90 95 

Lys Ala Glu Thr Leu Lys Pre Ala Ala Thr Ser Val Trp Asr* Lys Ala 
100 105 110 

Xaa Gin Phe Trp Ala Thr Tyr Met: Trp Asn Phe lie Ser Gly lie Gin 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (generic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: NC 



( i:< ) FEATu r^E : 

(A) NAME / KEY : misc_f eature 
(3) LOCATION: 1 . .23 

(D) OTHER INFORMATION : /star.dard_aame = " HCV Primer 
KCPrlSl" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

ACCGGAGGCC AGGAGAGTGA TCTCCTCC 2 8 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE T^PE : DNA (genomic) 

(iii) HYPOTHETICAL : YES 

(iii) ANTI-SENSE: YES 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 
(3) LOCATION: 1 . .28 

(D) OTHER INFORMATION: / scandard_name= "HC/ Primer 
HCPrlS2" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 : 
GGGCTGCTCT ATCCTCATCG ACGCCATC 

28 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA {genomic) 

(iii) HYPOTHETICAL: YES 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

r1 (A) NAME / KEY : misc_feature 

(B) LOCATION: 1. .23 

S (D) 0TKE * INFORMATION : /standard name= "HCV P^ime- 

L~ HC?riS3" " * 

<P < xi > SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Q GCCAGAGGCT CGGAAGGCGA TCAGCGCT 

(2> INFORMATION FOR SEQ ID NO: 66- 

^ <i> SEQUENCE CHARACTERISTICS: 

r " (A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL : YES 
(iii) ANTI-SENSE: YES 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION: 1. .28 

(D) OTHER INFORMATION: /standard_name= "HCV Prime- 
HCPrl64" 



28 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66; 
GAGCTGCTCT GTCCTCCTCG ACGCCGCA 

2. 8 

(2) INFORMATION FOR SEQ ID NO: 67: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STKANDEDNE5S: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iii) ANTI- SENSE: NO 

(ix) FEATURE: 

(A) NAME / KEY : misc_ieature 
(BJ LOCATION: 1..28 

(D) OTHER INFORMATION: /standard_name= "HCV Primer 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 27 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNE3S : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL : YES 
(iii) ANTI-SENSE: YES 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /standard name= "HCV Pri 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CTATTACCAG TT CAT CATC A TATCCCA 
(2) INFORMATION FOR SEQ ID NO : 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acicr 

(C) STRANDEDNESS : single 
■(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



HC?r23 " 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 67: 



CTCATGGGGT ACATTC 



20 
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(iii) HYPOTHETICAL: YES 

(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 
(3) LOCATION: 1 . . 28 

(D) OTHER INFORMATION: /standard_name= "HCV Primer 
HCPrllS- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 69: 
TTTTAAATAC ATCATGRCTG YATG 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: YES 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /standard_name= "HCV Primer 
HCPrSS" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CTATTATTGT ATCCCRCTGA TGAARTTCCA CAT 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iii) ANTI-SENSE: YES 

(ix) FEATURE: 

(A) NAME /KEY : misc feature 
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(B) LOCATION: 1..28 

(D) OTHER INFORMATION : / s t andar d_name = "HCV Primer 
HC?rll3 : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

ACTAGTCGAC TAYTGATCCR CTATRWARTT CCACAT 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 5 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : YES 
(iii) ANTI- SENSE: NC 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /s bandar d_name= "HCV Primer 
HC?rll7 : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
TTTTAAATAC ATCGCRCTGC ATGCA 
(2) INFORMATION FOR SEQ ID NO : 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: YES 

(iii) ANTI-SENSE: YES 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eanure 
(3) LOCATION: 1. .23 

(D) OTHER INFORMATION: / st andar d_name= "HCV Primer 
HCPrI19 : 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 73: 
ACTAGTCGAC TARTTGCATA GCCKRTTCAT CCAYTG 
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{2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
( iii) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION: 1..23 

(D) OTHER INFORMATION: /s tandard_name=: "HCV Pri 
HC?rl31 : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GGAATTCTAG ACCTCTGGGA YGARAYTGGA ARTG 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME / KEY : mi sc_f eature 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /s tandard_name= "HCV Primer 
HC?rl30 : 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 75: 
GGAATTCTAG ACGCTAYCAR GCACGTTGYG C 31 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(ill) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : misc_ feature 

(B) LOCATION: 1. .28 

(D) OTHER INFORMATION: /standard_name= "HCV Primer 
KC?rl34: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
CATATAGATG CCCACTTCCT ATC 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IS base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: YES 



(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION: 1. .23 

(D) OTHER INFORMATION: /standard_name= "HCV Primer 
KC?r3 : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
GTGTGCCAGG AC CATC 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
"(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: YES 



(iii) ANTI-SENSE: YES 



(ix) FEATURE: 



(A) NAME/KEY: misc_feature 
(3) LOCATION: I. .28 

(D) OTHER INFORMATION: /scandard_name= -EC/ Prii 



HC?r4 : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
GACATGCATG TCATGATGTA 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
(3) LOCATION: 1..28 

(D) OTHER INFORMATION: /standard_name= "HCV Primer 
HC?rl52 : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

TACGCCTCTT CTATATCGGT TGGGGCCTG 

(2) INFORMATION FOR SEQ ID NO : 80: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 26 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: NO 
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(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION; 1 . . 28 

(D) OTHER INFORMATION: /scandard_name= "HCV Primer 
HC?r52 : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
ATGTTGGGTA AGGTCATCGA TACCCT 
(2) INFORMATION FOR SEQ ID NO: 81: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 1 . . 29 

(D) OTHER INFORMATION: /standard_name= "HCV Primer 
HC?r41 : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 81: 
CCCGGGAGGT CTCGTAGACC GTGCA 2 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: YES 



(ix) FEATURE: 

(A) NAME / KEY : misc__f eature 
(3) LOCATION: 1 . . 28 

(D) OTHER INFORMATION: /standard_name= " HCV Primer 
HC?r40 : 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CTATTAAAGA TAGAGAAAGA GCAACCGGG 



(Z) INFORMATION FCR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(viii) POSITION IN PROTEIN: 

(B) MAP POSITION: positions 192 to 203 of the VI region of HCV 

type 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 
(iii) HYPOTHETICAL : NO 



(viii) POSITION IN PROTEIN : 

(B) MAP POSITION: positions 192 to 203 of the VI region of HCV 

type 5 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 84: 

Val Pro Tyr Arg Asn Ala Ser Gly lie Tyr His Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(viii) POSITION IN PROTEIN: 

(B) MAP POSITION: positions 213 to 223 of the V2 region ofHCV 

type 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Val Tyr Glu Ala As? Asp Val He Leu His Thr 
1 5 " 10 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amine acid 

(C) STRANDEDNZSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(iii) HYPOTHETICAL: NO 



(viii) POSITION IN PROTEIN : 

(B) MAP POSITION: positions 213 to 233 of the V2 region of HCV 

type 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Val Tyr Glu Ala Asp Asn Leu He Leu His Ala 
1 5 10 



(2) INFORMATION FOR SEQ ID NO : 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNE3S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(iii) HYPOTHETICAL: NO 



(viii) POSITION IN PROTEIN: 

(B) MAP POSITION: positions 230 to 242 of the V3 region of HCV 

type 3 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Val Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val 
1 5 io 

(2) INFORMATION FOR SEQ ID NO : 88: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(viii) POSITION IN PROTEIN: 

(B) MAP POSITION: positions 230 to 242 of the V3 region of HO/ 

type 5 



N! (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

fU Val Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin lie 

ru 15 io * 

J~ (2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 



if! 



(viii) POSITION IN PROTEIN: 

(B) MA? POSITION: positions 248 to 257 of the V4 region of HCV 

type 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 89: 

Val Arg Tyr Val Gly Ala Thr Thr Ala Ser 
15 io 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: pepcide 
(iii) HYPOTHETICAL: NO 

(viii) POSITION IN PROTEIN: 

(B) MA? POSITION: positions 248 to 257 of the V4 region of HCY 

type 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID SO: 90: 

Ala Pro Ser Leu Gly Ala Val Thr Ala Pro 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNZSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(viii) POSITION IN PROTEIN: 

(B) MAP POSITION: positions 294 to 303 of the V5 region or KCV 

type 3 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 91: 

Arg Pro Arg Arg His Gin Thr Val Gin Thr 
I 5 10 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL : NO 

(viii) POSITION IN PROTEIN: . 

(3) MAP POSITION: positions 294 to 303 of the V5 region of ncv 

type 5 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 92: 
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Arg Pro Arg Gla His Ala Thr Val Gin Asn 
1 5 1Q 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(viii) POSITION IN PROTEIN: 

(B) MAP POSITION: positions 70 to 78 of HCV type 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93- 
Gln Pro Thr Gly Arg Ser Trp Gly Gin 



1 



5 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE ; BR33 and BR36 

(viii) POSITION IN PROTEIN: 
type 3 ^ P0SITI0N: PO»icions 230 to 237 of the V3 region of HCV 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 

Val Gin Asp Gly Asn Thr Se- Th- 
1 5 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HD10 

(viii) POSITION IN PROTEIN: 

(B) MAP POSITION: positions 230 to 237 of the V3 region of HCV 

type 3 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Val Gin Asp Gly Asn Thr Ser Ala 
1 " 5 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



£ (ii) MOLECULE TYPE : peptide 

H (iii) HYPOTHETICAL: NO 



01 



(vi) ORIGINAL SOURCE: 
t% (C) INDIVIDUAL ISOLATE: BR3S 

(viii) POSITION IN PROTEIN: _ 
H (3) ^ POSITION : positions 248 to 257 of the V4 region or hCV 



type 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Val Lys Tyr Val Gly Ala Thr Thr Ala Ser 
1 S 10 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 

(vi) "ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BR36 
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(viii) POSITION IN GENOME: 

(B) MAP POSITION: Positions 1688 to 1707 of HCV type 3 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Leu Gly Gly Lys Pro Ala lie Val Pro Asp Lys Glu Val Leu Tyr Gln 

Gin Tyr Asp Glu 

20 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HD10 

(viii) POSITION IN GENOME : 

(B) MAP POSITION: positions 1688 to 1707 of HCV type 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Leu Gly Gly Lys Pro Ala Leu Val Pro Asp Lys Glu Val Leu Tyr Gin 
15 10 15 

Gin Tyr Asp Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(viii) POSITION IN GENOME: 

(B) MA? POSITION: positions 1712 to 1731 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
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Ser Gin Ala Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His Gin 
15 10 15 

Phe Lys Glu Lys 
20 

(2) INFORMATION 7C?. SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(3) TYPE : amino acid 
(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BR36 

(viii) POSITION IN GENOME: 

(3) MAP POSITION: positions 1724 to 1743 of HCV type 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

He Ala His Gin Phe Lys Glu Lys Val Leu Gly Leu Leu Gin Arg Ala 
15 10 15 

Thr Gin Gin Gin 
20 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HD10 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: positions 1724 to 1743 of HCV type 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

He Ala His Gin Phe Lys Glu Lys He Leu Gly Leu Leu Gin Arg Ala 
15 10 15 
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Thx Gin Gin Gin 
20 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNES3 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(viii) POSITION IN GENOME: 

(B) MAP POSITION : positions 1SBQ to 1707 of HCV type 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Leu Ser Gly Lys Pro Ala He He Pro Asd Arg Glu Ala Leu Tyr G 1 : 
1 5 10 15 

Gin Phe Asp Glu 

20 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNES3 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 
(iii) HYPOTHETICAL : NO 



(viii) POSITION IN GENOME: 

(B) MAP POSITION: positions 1638 to 1707 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Leu Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr Gin 
15 10 15 

Gin Phe Asp Glu 

20 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 
(iii) HYPOTHETICAL: NO 



(viii) POSITION IN GENOME: 

(B) MAP POSITION: position 1712 to 1731 of HCV type 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Ser Ala Ser Leu Pro Tyr Met Asp Glu Thr Arg Ala lie Ala Gly Gin 
1 5 10 15 

Phe Lys Glu Lys 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(viii) POSITION IN GENOME: 

(3) MA? POSITION: positions 1724 to 1743 of HCV type 5 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
lie Ala Gly Gin Phe Lys Glu Lys Val Leu Gly Phe lie Ser Thr Thr 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



20 



1 



5 



10 



15 



Gly Gin Lys Ala 
20 
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(iii) ANTI-SENSE: NO 




(vii) IMMEDIATE SOURCE: 

(B) CLONE: G343-3-10 

( IX) FEATURE: 

(A) NAME /KEY: CDS 
(3) LOCATION : 2. .340 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IDS: 

C TCC ACT GTA ACC GAA AAG GAC ATC AGG GTC GAG GAG GAG GTC TAT 4 6 

Ser Thr Val Thr Glu Lys Asp He Arg Val GIu Glu Glu Val Tyr 
15 10 15 

CAG TGT TGT GAC CTG GAG CCC GAA GCC CGC AAG GCA ATT ACC GCC CTA 94 
Gin Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Ala He Thr Ala Leu 
20 25 30 

ACA GAG AGA CTC TAC GTG GGC GGT CCC ATG CAT AAC AGC AAG GGA GAC 14 2 

Thr Glu Arg Leu Tyr Val Gly Gly Pro Met His Asr. Ser Lys Gly Asp 
35 40 45 

CTG TGC GGG TAT CGC AGA TGT CGC GCA AGC GGC GTC TAC ACC ACC AGC 190 
Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 
50 55 SO 

TTC GGG AAC ACA CTG ACG TGC TAC CTC AAA GCC TCA GCC GCT ATC AAA 23 8 

?he Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala He Lys 
65 70 75 

GCG GCG GGG CTG AGA GAC TGC ACC ATG TTG GTC TGT GGT GAT GAC CTG 23o 
Ala Ala Gly Leu Arg Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu 
80 85 90 95 

GTT GTC ATC GCT GAG AGC GAT GGC GTA GAG GAG GAC AAA CGA CCC CTC 3 34 

Val Val He Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg Pro Leu 
100 105 110 

GGA GCC 340 
Gly Ala 



(2) INFORMATION FOR SEQ ID NO : 107: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

Ser Thr Val Thr Glu Lys Asp He Arg Val Glu Glu Glu Val Tyr Gin 
15 10 15 
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Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Ala lie Thr Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Lys Gly Asp Leu 
35 40 45 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 
50 55 SO 

Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala He Lys Ala 
65 70 75 80 

Ala Gly Leu Arg Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 
85 90 95 

Val He Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg Pro Leu Gly 
100 105 no 

Ala 

{2) INFORMATION FOR SEQ ID NO : 10 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 34 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

<B) CLONE : G311S-3-5 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
£B} LOCATION: 2.. 340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

C TCC ACT GTA ACC GAA AAG GAC ATC AGG GTC GAG GAG GAG GTA TAT 4 6 

Ser Thr Val Thr Glu Lys Asp He Arg Val Glu Glu Glu Val Tyr 
15 10 15 

CAG TGT TGT GAC CTG GAG CCC GAG GCC CGC AG A GCA ATT ACC GCC CTA 94 
Gin Cys Cys Asp Leu Glu Pro Glu Ala Arg Arg Ala He Thr Ala Leu 
20 25 30 

ACA GAG AGA CTC TAC GTG GGC GGT CCC ATG CAT AAC AGC AGG GGA GAC 14 2 

Thr Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Arg Gly Asp 
35 40 45 



SUBSTITUTE SHEET (RULE 26) 



WO 94/2560 



172 




PCT/EP94/0D23 



CTG TGC GGG TAT CGC AGA TGC CGT GCG AGC GGC GTC TAC ACC ACC AGC 
Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 
50 55 so 



190 



TTC GGG AAC ACA CTG ACG TGC TAT CTC AAA GCC TCA GCC GCT ATC AGA 
Phe Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala He Arg 
65 70 75 



233 



GCG GCG GGG CTG AGA GAC TGC ACC ATG TTG GTC TGT GGT GAT GAC CTG 
Ala Ala Gly Leu Arg Asp Cys Thr Mez Leu Val Cys Gly Asp Asp Leu 
80 35 90 95 



286 



GTC GTC ATT GCT GAA AGC GAT GGC GTA GAG GAG GAC AAA CGA GCC CTC 
Val Val He Ala Glu Ser Asp Gly Val Glu Glu As? Lys Arg Ala Leu 
100 105 no 



334 



GGA GCC 
Gly Ala 



340 



(2) INFORMATION FOR SEQ ID NO: 10 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYFE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC : 109: 

Ser Thr Val Thr Glu Lys Asp lie Arg Val Glu Glu Glu Val Tyr Gin 
^5 10 15 

Cys Cys Asp Leu Glu Pro Glu Ala Arg Arc Ala lie Thr Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Arg Gly Asp Leu 
35 40 45 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 
50 55 60 

Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala lie Arg Ala 
65 70 75 80 

Ala Gly Leu Arc Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu. Val 
85 90 95 

Val lie Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg Ala Leu Gly 
100 105 110 

Ala 



(2) INFORMATION FOR SEQ ID NO: 110: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: GB215-3-8 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
(3) LOCATION: 2.. 340 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

C TCC ACT GTA ACC GAA AAA GAC ATC AGG GTC GAG GAG GAG GTA TAT 4 5 

Ser Thr Val Thr Glu Lys As? lie Arg Val Glu Glu Glu Val Tyr 
15 10 15 

CAG TGT TGT GAC CTG GAG CCC GAA GCC CGC AAG GTA ATT ACC GCC CTA 94 
Gin Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val lie Thr Ala Leu 
20 25 30 

ACA GAG AGA CTC TAT GTG GGC GOT CCC ATG CAT AAT AGC AAA GGA GAC 14 2 

Thr Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Lys Gly Asp 
35 40 45 

CTG TGC GGG TAT CGC AGA TGC CGC GCA AGC GGC GTC TAC ACC ACC AGC 190 
Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 
50 55 60 

TTC GGG AAC ACA CTG ACG TGC TAT CTC AAA GCC TCA GCC GCC ATC AGG 23 3 

Phe Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala lie Arg 
65 70 75 

GCG TCA GGG CTG AGA GAC TGC ACT ATG CTG GTC TAT GGT GAC GAC CTG 2 35 

Ala Ser Gly Leu Arg Asp Cys Thr Met Leu Val Tyr Gly Asp Asp Leu 
80 85 90 95 

GTC GTC ATT GCC GAG AGC GAT GGC GTA GAG GAG GAC AAA CGA GCC CTC 3 34 

Val Val lie Ala Glu Ser Asp Gly Val Glu Glu As? Lys Arg Ala Leu 
100 105 110 



GGA GTC 
Glv Val 



340 



(2) INFORMATION FOR SEQ ID NO: 111: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC : 111: 

Ser Thr Val Thr Glu Lys Asp lie Arg Val Glu Glu Glu Vai Tyr Gin 
15 10 is 

Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val lie Thr Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Vai Gly Gly Pro Met His Asn Ser Lys Gly Asp Leu 
35 40 45 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Vai Tyr Thr Thr Ser Phe 
50 55 60 

Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala lie Arg Ala 
65 70 75 8C 

Ser Gly Leu Arg Asp Cys Thr Met Leu Val Tyr Gly Asp Asp Leu Val 
85 90 95 

Val lie Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg Ala Leu Gly 
100 105 110 

Val 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34C base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : CDNA 
(ili) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: G33 53-3-3 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2 . . 340 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
C TCC ACT GTA ACC GAA AAG GAC ATC AGG GTC GAG GAG GAG GTG TAT 4 6 
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Ser Thr Val Thr Glu Lys Asp He Arg Val Glu Glu Glu Val Tyr 
1 5 10 15 

CAG TGT TGT GAC CTG GAG CCC GAG GCC CGC AAG GCA ATT ACT GCC CTA 94 
Gin Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Ala He Thr Ala Leu 
20 25 30 

ACA GAG AGA CTC TAT GTG GGC GGT CCC ATG CAT AAC AGC AAG GGA GAC 142 
Thr Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Lys Gly Asp 
35 40 45 

CTG TGT GGG TAT CGC AGA TGC CGC GCA AGC GGC GTC TAC ACC ACC AGC 190 
Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 
50 55 SO 

TTC GGG AAC ACA CTG ACG TGC TAC CTC AAA GCC TCA GCC GCT ATC AGA 2 38 

Phe Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala He Arg 
65 70 75 

GCG GCG GGG CTG AGA GAC TGC ACC ATG TTG GTC TGT GGT GAT GAC CTG 2 96 

Ala Ala Gly Leu Arg Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu 
30 ' 8S ' 90 95 

GTC GTC ATC GCT GAG AGC GAT GGC GTT GAG GAG GAC AAA CGA GCC CTC 3 34 

Val Val He Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg Ala Leu 
100 105 HO 



GGA GCC 
Gly Ala 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 113 amino acids 
(3) TYPE: amine acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Ser Thr Val Thr Glu Lys Asp He Arg Val Glu Glu Glu Val Tyr Gin 
15 10 15 

Cvs Cys Asp Leu Glu Pro Glu Ala Arg Lys Ala He Thr Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Lys Gly Asp Leu 
35 40 45 

Cvs Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 
50 55 60 

Gly Asn Thr Leu Thr Cvs Tyr Leu Lys Ala Ser Ala Ala He Arg Ala 
65 70 75 80 



340 
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Ala Gly Leu Arg Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 

8S 90 " 95 

Val lie Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg Ala Leu Gly 

100 105 ~ * 110 

Ala 



(2) INFORMATION FOR SEQ ID tfO : 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(3) CLONE: G3549-3-6 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(3) LOCATION: 2.. 340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

C TCC ACG GTG ACC GAA AGG GAT ATC AGG ACC GAG GAA GAG ATC TAC 4 5 

Ser Thr Val Thr Glu Arg Asp He Arg Thr Glu Glu Glu He Tyr 
1 5 10 15 

CAG TGC TGC GAC CTG GAG CCC GAA GCC CGC AAG GTG ATA TCC GCC CTA 94 
Gin Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val He Ser Ala Leu 
20 25 30 

ACG GAA AGA CTC TAC GTG GGC GGT CCC ATG TAC AAC TCC AAG GGG GAC 14 2 

Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp 
3S 40 45 

CTA TGC GGG CAA CGG AGG TGC CGC GCA AGC GGG GTC TAC ACC ACC AGC 190 
Leu Cys Gly Gin Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 
50 55 60 

TTC GGG AAC ACT GTA ACG TGT TAT CTC AAG GCC GTT GCG GCT ACT AGG 2 38 

Phe Gly Asn Thr Val Thr Cys Tyr Leu Lys Ala Val Ala Ala Thr Arg 
65 70 75 

GCC GCA GGT CTG AAA GGT TGC AGC ATG CTG GTT TGT GGA GAC GAC TTA 28 6 

Ala Ala Gly Leu Lys Gly Cys Ser Met Leu Val Cys Gly Asp Asp Leu 
80 85 90 95 
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GTC GTC ATC TGC GAG AGC GGC GGC GTA GAG GAG GAT GCA AGA GCC CTC 
Val Val lie Cys Glu Ser Gly Gly Val Glu Glu Asp Ala Arg Ala Leu 
100 105 110 



334 



CGA GCC 
Arg Ala 



340 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 113 amino acids 
(S) TYPE: amine acid 
(D) TOPOLOGY: linear 

{ ii) MOLECULE TYPE: procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Ser Thr Val Thr Glu Arg Asp He Arg Thr Glu Glu Glu lie Tyr Gin 
15 10 15 

Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val He Ser Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp Leu 
35 40 45 

Cys Gly Gin Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 
50 55 60 

Gly Asn Thr Val Thr Cys Tyr Leu Lys Ala Val Ala Ala Thr Arg Ala 
65 70 75 80 

Ala Gly Leu Lys Gly Cys Ser Met Leu Val Cys Gly Asp Asp Leu Val 
85 90 95 

Val He Cys Glu Ser Gly Gly Val Glu Glu Asp Ala Arg Ala Leu Arg 
100 105 110 

Ala 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES 5 : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: G3809-3-1 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 : 

C TCC ACT GTG ACT GAG AGA GAC ATC AAG GTC GAA GAA GAA GTC TAT 
Ser Thr Val Thr Glu Arg Asp lie Lys Val Glu Glu Glu Val Ty- 
5 io Is 

CAG TGT TGT GAT CTG GAG CCC GAG GCC CGC AAG GTA ATA GCC GCC CTC 
Gin Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val He Ala Ala Leu 
20 25 30 

ACG GAG AGA CTC TAC GTG GGC GGC CCC ATG CAT AAC AGC AAG GGA GAC 
Thr Glu Arg Leu Tyr Val Gly Gly Pro Met: His Asn Ser Lvs Glv As? 
35 40 45 

CTT TGC GGG TAT CGT AGA TGC CGC GCG AGC GGC GTA TAC ACC ACG AGC 
Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 



50 55 



60 



100 105 



110 



GGA GCT 
Gly Ala 



46 



94 



142 



L90 



T . GG. AAC ACA ATG ACG TGC TAC CTT AAG GCC TCA GCA GCC ATC AGG 238 
Phe Gly Asr, Thr Met Thr Cys Tyr Leu Lys Ala Ser Ala Ala 1^ Ara 
65 7 0 75 



296 



GCT GCG GGG CTA AAG GAT TGC ACC ATG CTG GTT TGC GGT GAC GAC CTA 
Ala Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cvs Gly Asd Aso Leu 
80 85 90 95 

GTC GTG ATC GCC GAG AGC GGT GGC GTT GAG GAG GAC AAA CGA GCC CTC 334 
Val Val He Ala Glu Ser Gly Gly Val Glu Glu Asp Lys Arg Ala Leu 



340 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: proCein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117; 

Ser Thr Val Thr Glu Arg Asp He Lys Val Glu Glu Glu Val Tyr Gin 
15 10 15 
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Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val lie Ala Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Lys Gly Asp Leu 
35 40 45 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 

5 n 5 5 SO 

Gly Asn Thr Mec Thr Cys Tyr Leu Lys Ala Ser Ala Ala He Arg Ala 
65 70 75 80 

Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 
35 90 95 

Val He Ala Glu Ser Gly Gly Val Glu Glu Asp Lys Arg Ala Leu Gly 
100 105 HO 



Ala 



• ta- 
re I 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES3 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



yi (iii) ANTI -SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(3) CLONE: GB353-4-I 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1 . . 574 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

ACT TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTA GGC GCC 4 3 

Thr Cvs Gly Phe Ala Asu Leu Met Gly Tyr He Pro Leu Val Gly Ala 
1*5 10 15 

CCT GTG GGT GGC GTC GCC AGG GCC CTG GCA CAC GGT GTT AGG GCT GTG 96 
p>- 0 va^ Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 



GAG GAC GGG ATC AAT TAT GCG ACA GGG AAT CTT CCC GGT TGC TCT TTC 
Glu Aso Gly lie Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 



144 
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TCT ATC TTC CTC TTG GCA CTT CTT 
Ser lie Phe Leu Leu Ala Leu Leu 
50 55 



180 

TCG TGC 
Ser Cys 



FCT/EP94/01323 



CTG ACT GTT CCC 
Leu Thr Val Pro 
60 



ACC TCG 
Thr Ser 



192 



GCC GTC AAC TAT CGC AAT GCC TCG GGC ATC 
Ala Val Asn Tyr Arg Asn Ala Ser Gly He 
65 70 



TAT CAC ATC ACC 
Tyr His He Thr 
75 



AAT GAC 
Asn Asp 
80 



240 



TGC CCG AAC TCG 
Cys Pro. Asn Ser 



AGC 
Ser 
85 



ATA 

He 



GTG 
Val 



TAC 
Tyr 



GAG ACC 
Glu Thr 
90 



GAG CAC CAC ATC 
Glu His His Xle 



CTA CAC 
Leu His 
95 



233 



CTC CCA GGG TGT TTA CCC TGC GTG AGG GTT GGG AAT CAG TCA 
Leu Pro Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gin Ser 
100 105 no 



CGC TGC 
Arg Cys 



336 



TGG GTG GCC CTC ACT CCC ACC 
Trp Val Ala Leu Thr Pro Thr 
115 



GTG 
Val 
120 



GCG GCG 
Ala Ala 



CCT TAC 
Pro Tyr 



ATC GGC 
He Gly 
125 



GCT CCG 
Ala Pro 



334 



CTT GAA TCC CTC CGG AGT CAT GTG GAT CTG ATG GTA GGT GCC GCT ACT 
Leu Glu Ser Leu Arg Ser His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 



43: 



GCG TGC TCC GCT CTT TAC- ATC GGA GAC CTG TGC GGT GGC GTA TTC TTG 
Ala Cys Ser Ala Leu Tyr He Gly Asp Leu Cys Gly Gly Val Phe Leu 
145 150 155 160 



430 



GTT GGT CAG ATG 
Val Glv Gin Met: 



T*"7C 

Phe 
155 



TCT 
Ser 



TTC 
Phe 



CAG 
Gin 



CCG CGG 
Pro Arg 
170 



CGC CAC 
Arg His 



TGG ACT 
Trp Thr 



ACG CAG 
Thr Gin 
175 



528 



GAC TGC AAT TGT TCC ATC TAC GCG GGG CAC 
Asp Cys Asn Cys Ser lie Tyr Ala Gly His 
130 185 



GTT ACG GGC CAC AGG A 
Val Thr Gly His Arg 
190 



574 



(2) INFORMATION FOR SEQ ID NO : 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
15 10 15 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser lie Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 
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SO 



55 



60 



Ala Val Asn Tyr Arg Asn Ala Ser Gly lie Tyr His He Thr 



65 



70 



75 



Asn Asp 
80 



Cys Pro Asn Ser Ser He Val Tyr Glu Thr Glu His His He Leu His 
35 90 95 

Leu Pro Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gin Ser Arg Cvs 
100 1Q5 



110 



rrp Vai Ala Leu Thr Pro Thr Val Ala Al« 



La Pro Tyr He Giy Ala Pro 
li5 120 12 5 

Leu Glu Ser Leu Arg Ser His Val Asp Leu Met Val Gly Ala Ala Thr 

130 135 140 

Ala Cys Ser Ala Leu Tyr He Gly Asp Leu Cvs Gly Gly Val Phe Leu 
145 ^ 155 i 60 

Val Gly Gin Mec Phe Ser Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
"5 170 * 173 

Asp Cys Asn Cys Ser lie Tyr Ala Gly His Vai Thr Giy His Ar~ 
2-30 185 19Q 



(2) INFORMATION PC?. SEQ ID NO : 120: 



IJ1 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: G3543-4-3 

(ix) FEATURE: 

(A) NAME /KEY: CDS 
(3) LOCATION: 1 . . 574 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 12( 



ACG TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTG GGC GCC 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 

1 5 10 15 

CCT GTG GGT GGC GTC GCC AGG GCC TTG GCA CAT GGT GTC AGG GCC GTG 

Pro Val Gly Giy Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 

20 25 30 



43 



95 
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GAG GAC GGG ATT AAC TAT 
Glu Asp Gly lie Asn Tyr 
35 

TCT ATC TTC CTT CTA GCA 
Ser lie Phe Leu Leu Ala 
50 

GCG CAG CAC TAC CGG AAC 
Ala Gin His Tyr Arg Asn 
65 70 

TGC CCG AAC TCT AGT ATA 

Cys Pro Asn Ser Ser lie 
35 

CTA CCA GGG TGT GTG CCT 

Leu Pro Gly Cys Val Pro 
100 

TGG GTT CCT TTA ACA CCC 

Trp Val Pro Leu Thr Pro 
115 



GCA ACA GGG AAT CTT CCC GGT TGC 
Ala Thr Gly Asn Leu Pro Gly Cys 

40 45 
CTT CTC TCG TGC TTG ACT GTC CCG 
Leu Leu Ser Cys Leu Thr Val Pro 
55 60 

ATC TCG GGC ATT TAT CAC GTC ACC 
He Ser Gly He Tyr His Val Thr 
75 

GTG TAT GAA GCT GAC CAT CAT ATC 
Val Tyr Glu Ala Asp Kis His He 
90 

TGC GTG AGA ACC GGG AAC ACC TCG 
Cys Val Arg Thr Gly Asn Thr Ser 
105 no 



TCC TTT 
Ser Phe 

GCC TCG 
Ala Ser 



AAT GAC 
Asn Asp 
80 

ATG CAT 
Met His 
95 

CGC TGC 
Arg Cys 



ACT GTG GCT GCC CCC TAT GTT GGC GCG CCG 
Thr Val Ala Ala Pro Tyr Val Gly Ala Pro 

120 125 



144 



192 



240 



288 



336 



38- 



CTC GAA TCC ATG CGG CGG 
Leu Glu Ser Met Arg Arg 
130 

GTC TGC TCG GCC CTG TAC 
Val Cys Ser Ala Leu Tyr 
145 150 

GTC GGG CAG ATG TTC ACC 
Val Gly Gin Met Phe Thr 
165 

GAC TGC AAC TGC TCT ATC 
Asp Cys Asn Cys Ser He 
180 



CAC GTG GAC TTA ATG GTG GGT GCC GCC ACC 
His Val As? Leu Met Val Gly Ala Ala Thr 
135 140 

ATC GGA GAC CTT TGC GGA GGT GTC TTC CTG 
He Gly Asp Leu Cys Gly Gly Val Phe Leu 
155 160 

TTC CGG CCG CGC CGC CAT TGG ACT ACC CAG 
Phe Arg Pro Arg Arg His Trp Thr Thr Gin 
170 175 

TAT GAT GGC CAC ATC ACC GGC CAT AGA A 
Tyr Asp Gly His He Thr Gly Kis Arg 
185 190 



431 



480 



523 



574 



(2) INFORMATION FOR SEQ ID NO : 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
15 10 15 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 " 30 
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Glu Asp Gly lie Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
->c 40 4S 



Ser lie Phe Leu Leu Ala 



Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 



SO 



55 



SO 



Ala Gin His Tyr Arg Asn He Ser Gly lie Tyr Hxs Val Thr Asn Asp 
65 ™ 7S 80 



Cys Pro Asn Ser Se 



r He Val Tyr Glu Ala As? His His lie Met His 



35 



90 



95 



Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Thr Ser Arg Cys 

105 HO 



ICC 



Trp Val Pro Leu Thr Pro Thr Val Ala Ala Pro Tyr Val Gly Ala Pro 
e 12 5 



115 



120 



Leu Glu Ser Met Arg Arg His Val Asp Le 

130 135 140 

Val Cys Se 
145 

Va 



u Me- Val Gly Ala Ala Thr 



r Ala Leu Tyr He Gly Asp Leu Cys Gly Gly Val Phe Leu 



150 



155 



i Gly Gin Mec Phe Thr Phe Arg Pro Arg Arg H-S 
1S5 



160 

Trp Thr Thr Gin 
175 



Glv His lis Thr Gly His Arg 



Aso Cys Asn Cys Ser lie Tyr Asp Gly 

130 185 190 



(2) INFORMATION FOR SEQ ID NO : 12 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(3) CLONE: GBS09-4-3 

( ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 1 . -574 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

_ rr^ *** rrn CTC GTG GGC GCC 

ACG TGC GGC TTC GCC GAC CTC AiG GGA T«v. A-*- 
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Thr Cys Gly Phe Ala 
1 5 

CCC GTT GGG GGC GTC 
Pro Val Gly Gly Val 

20 

GAG GAC GGG ATT AAC 
Glu Asp Gly He Asn 
35 

TCT ATC TTC CTC CTG 
Ser lie Phe Leu Leu 

50 

GCT GAG CAC TAC CGG 
Ala Glu His Tyr Arg 

TGT CCG AAT TCC 
Cys Pro Asn Ser 
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TTG CCG 
Leu Pro 

TGG ACG 
Tro Thr 



CTC GAG 
Leu Glu 
130 

GTG TGT 
Val Cys 
145 

GTG GGG 
Val Gly 



GGG TGC 
Gly Cys 
100 
CCG GTA 
Pro Val 
115 

TCC TTC 
Ser Phe 



TCT GTC 
Ser Val 



CAG ATG 
Gin Met 



GAT TGT AAT TGC 
Asp Cys Asn Cys 
180 



AGC 
Ser 
35 

GTA 
Val 

ACA 



CGG 
Arg 



CTC 
Leu 



Phe 
165 

TCC 
Ser 



Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
10 15 

GCC AGG GCC CTG GCG CAT GGC GTC AGG GCT GTG 
Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
25 30 

TAT GCG ACA GGG AAT CTT CCC GGT TGC TCT TTC 
Tyr Ala Thr Gly Asn Leu Pro Gly Cys Se- 
40 45 

GCA CTT CTT TCG TGC CTC ACT GTC CCA GCG TCA 
Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 

AAT GCT TCG GGC ATC TAT CAC ATC ACC AAT GAC 
Asn Ala Ser Gly He Tyr His lie Thr Asn As- 
70 75 go 

GTA GTC TAT GAA ACT GAC CAC CAT ATA TTG CAC 
Val Val Tyr Glu Thr Asp His His lie Leu His 
90 95 

CCC TGC GTG AGG GCC GGG AAC GTG TCT CGT TGC 
Pre Cys Val Arg Ala Gly Asn Val Ser Arg Cvs 

CCT ACG GTG GCT GCC GTA TCC ATG GAC GCT CCG 
Pro Thr Val Ala Ala Val Ser Met Aso Ala Pro 
120 X25 

CGG CAT GTG GAC CTA ATG GTA GGT GCG GCC ACC 
Arg His Val Asp Leu Met Val Gly Ala Ala Th- 
135 140 

TAT GTT GGA GAC CTC TGT GGA GGT GCT TTC CTA 
Tyr Val Gly Asp Leu Cys Glv Gly Ala Phe Leu 
150 155 160 

ACC TTC CAG CCG CGT CGC CAC TGG ACC ACG CAG 
Thr Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
170 ' its 

ATC TAT ACT GGC CAT ATC ACC GGC CAC AGG A 
lie Tyr Thr Gly His lie Thr Gly His Arg 
185 i9o 



96 



144 



192 



240 



28S 



336 



384 



432 



430 



52S 



57^ 



(2) INFORMATION FOR SEQ ID NO: 123: 

{ l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 3: 
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Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
15 10 is 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser lie Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 

Ala Glu His Tyr Arg Asn Ala Ser Gly He Tyr His He Thr Asn Asp 
S5 70 75 80 

Cys Pro Asn Ser Ser Val Val Tyr Glu Thr Asp His His He Leu His 
85 90 95 

Leu Pro Gly Cys Val Pro Cys Val Arg Ala Gly Asn Val Ser Arg Cys 
100 105 110 

Trp Thr Pro Val Thr Pro Thr Val Ala Ala Val Ser Met Asp Ala Pro 
115 120 125 

Leu Glu Ser Phe Arg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 

Val Cys Ser Val Leu Tyr Val Gly Asp Leu Cys Gly Gly Ala Phe Leu 
145 150 155 160 

Val Gly Gin Met Phe Thr Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
165 170 175 

Asp Cys Asn Cys Ser lie Tyr Thr Gly His lie Thr Gly His Arg 
130 185 190 

(2) INFORMATION FOR SHQ ID NO: 124: 
(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH : 31 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDSDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 
(3) LOCATION: 1 . . 31 

(D) OTHER INFORMATIONS /s tandard_name= "HO/ Primer HC?r206" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 124: 
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TGGGGATCCC GTATGATACC CGC7GCTTTG A 



31 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1. .30 

(D) OTHER INFORMATION: /standard_name= "HCV Primer Hc?r207 " 
(xi) SEQUENCE DESCRIPTION: SEQ ID NC : 125: 
GGCGGAATTC CTGGTCATAG CCTCCGTGAA 3 C 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
(3) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: GB353 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12 5: 

Val Asn Tyr Arg Asn Ala Ser Gly lie Tyr His lie 
15 10 
(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NC 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Amino acid 

(C) INDIVIDUAL ISOLATE: GBS4 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Gin His Tyr Arg Asn lie Ser Gly lie Tyr His Val 
IS 10 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 amino acids 
(3) TYPE: amine acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : pec ride 

(iii)- HYPOTHETICAL : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Anmo acid 

(C) INDIVIDUAL ISOLATE: G33 0 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Glu His Tyr Arg Asn Ala Ser Gly He Tyr His He 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amine acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NC 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: G33 5 9 



(xi) SEQUENCE DESCRIPTION: SZ2 ID NO: 129: 

Val Tyr Glu Thr Glu His His lie Leu His Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 13 0: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: II amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : pepcide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: GB549 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 0: 

Val Tyr Glu Ala Asp His His lie Met His Leu 
1 5 io 

{2) INFORMATION FOR SEQ ID NO : 131: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 ammo acids 
(3) TYPE: amine acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: G3809 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 131: 
Val Tyr Glu Thr Asp His His He Leu His Leu 



(2) INFORMATION FOR SEQ ID NO : 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NC 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: G3358 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
Val Arg Val Gly Asn Gin Ser Arg Cys Trp Val Ala Leu 



(2) INFORMATION FOR SEQ ID NO : 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE3S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

{A} ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: GB54 9 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

Val Arg Thr Gly Asn Thr Ser Arg Cys Trp Val Pro Leu 
15 10 

(2) INFORMATION FOR SEC ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
(3) TYPE: amine acid 

(C) STRANDEDNE53 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE : GB809 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

Val Arg Ala Gly Asr. Val Ser Arg Cys Trp Thr Pro Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
(3) TYPE : amino acid 

(C) STRANDEDNE3 3 : single 

(D) TOPOLOGY: linear 



1 



5 



10 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: GB353 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 : 

Ala Pro Tyr lie Gly Ala Pro Leu Glu Ser 
15 io 

(2) INFORMATION FOR SEQ ID NO : 13 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: G3549 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Ala Pro Tyr Val Gly Ala Pro Leu Glu Ser 
15 io 

(2) INFORMATION FOR SEQ ID NO : 13 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: G3809 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 7: 

Ala Val Ser Met Asp Ala Pro Leu Glu Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 138: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MCLECULE TYPE: pepcide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: G3353 and GB309 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

Gin Pro Arg Arg His Trp Thr Thr Gin Asp 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

{ i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10 amino acids 
(3) TYPE: amino acid 

(C) STRAND EDNE3S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: G3549 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 9: 
Arg Pro Arg Arg His Trp Thr Thr Gin Asp 



(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNZSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 
(iii) HYPOTHETICAL : NC 
(vi) ORIGINAL SOURCE : 



1 



5 



10 
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(A) ORGANISM: amino acid 

(C) INDIVIDUAL ISOLATE: GB54 9 



£xi} SEQUENCE DESCRIPTION: SEQ ID NO: 140 : 

Arg Pro Arg Arg His Trp Thr Thr Gin Asp 
1 5 10 



(2) INFORMATION FOR SEQ ID NO : 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
Ciii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
TGGGATATGA TGATGAACTG GTC 
(2) INFORMATION FOR SEQ ID NO: 142: 

£i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 4 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
CCAGGTACAA CCGAACCAAT TGCC 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .957 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1. .954 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

ATG AGO ACA AAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACT AAC 4 3 

Mec Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 10 15 

CGC CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGC CAG ATC GTT GGT 95 
A-g Arg P-o Gin Aso Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 ~ 25 30 

GGA GTA TAC TTG TTG CCG CGC AGG GGC CCC CGG TTG GGT GTG CGC GCG 144 
Glv Val Tvr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACG AGG AAA ACT TCC GAG CGG TCC CAG CCA CGT GGG AGG CGC CAG CCC 192 
Thr Arg Lvs Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 * 55 60 

A^C CCC AAA GAT CGG CGC CCC ACT GGC AAG TCC TGG GGA AAA CCA GGA 24 0 

lie Pro Lys Aso Arc Arg Pro Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 ~ ^ 70 75 B0 

TAC CCT TGG CCC CTG TAC GGG AAT GAG GGC CTC GGC TGG GCA GGG TGG 28 3 

Tyr Pro Trr> Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
35 90 95 

CTC CTG TCC CCC CGA GGG TCT CGC CCG TCA TGG GGC CCA ACT GAC CCC 33 6 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr As? Pro 
100 105 HO 



CGG CAC AGG TCA CGC AAC TTG GGT AAG GTC ATC GAT ACC CTT ACG TGT 
A-g His Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

GGC TTT GCC GAC CTC ATG GGG TAC ATC CCT GTC GTC GGC GCC CCA GTT 
Gly Phe Ala Aso Leu Me: Gly Tyr lie Pro Val Val Gly Ala Pro Val 
130 135 140 

GGT GGT GTC GCC AGA GCT CTC GCG CAT GGC GTG AGA GTT CTG GAA GAC 
Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu As? 
145 150 155 ISO 



334 



432 



480 
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AAC TAT 
Asn Tyr 



TTC TTA 
Phe Leu 



CAG GTC 
Gin Val 



AAC AGT 
Asn Ser 
210 

GGG TGT 
Gly Cys 
225 

CCG GTT 
Pro Val 



TTG GCC 
Leu Ala 
180 

AAG AAC 
Lys Asn 
195 

AGC ATC 
Ser lie 



GCA 
Ala 
165 

CTG 
Leu 



ACC 
Thr 



GTC 
Val 



ACA 
Thr 



CTA 
Leu 



AGC 
Ser 



TGG 



GGG AAC TTG CCC GGT TGC 
Gly Asn Leu Pro Gly Cys 
170 

TCT TGT ATC ACT GTG CCG 
Ser Cys lie Thr Val Pro 
135 

TAC ATG GTA ACC 
Tyr Met Val Thr 



GTC CCT 
Val Pro 



TCG CCC- 
Ser Pro 



TGT 
Cys 



AAT 
Asn 
245 



GAG 
Glu 
230 

ATA 



AGC TCT 
Ser Ser 
200 

CAG CTC 
Gin Leu 
215 

GAG AAG 
Glu Lys 



GCT GTG 
A^ a Va 1 



GGC CTG 
Gly Leu 



TCT GCC 
Ser Ala 



CAA GTC 
Gin Val 
290 

AAC TGT 
Asn Cys 
305 



CGG ACS 
Arg Thr 
2S0 

CTG TAC 
Leu Tyr 
275 

TTC ATC 
Phe lie 



TCC ATA 
Ser lie 



CAT ATT GAT ACC 
His lie Asp Thr 



ATA 
lie 



ATC 
lie 



TAC 
Tvr 



GGA 
Gly 



TCG 
Ser 



CCA 
Pro 
310 



GAC CTG 
Asp Leu 
230 

CCC CAG 
Pro Gin 
295 

GGC CAC 
Gly Kis 



AGG GAT GCT GTT 
Arg Asp Ala Val 
220 

GGC AAC ATA TCC 
Gly Asn lie Ser 
235 

AGC CAA CCT GGT 
Ser Gin Pro Gly 
250 

ATC ATT GCA TCC 
lie lie Ala Ser 
265 

TGT GGC GCG GTG 
Cys Gly Ala Val 



TCC TTT TCT ATC 
Ser Phe Ser lie 
175 

GTC TCC GGC TTG 
Val Ser Gly Leu 
190 

AAT GAC TGC CAG 
Asn Asp Cys Gin 
205 

CTT CAC GTC CCC 
Leu His Val Pro 



CGC TGT 
Arg Cys 



GCG CTT 
Ala Leu 



GCT ACG 
Ala Thr 
270 



CAT CAT AAG TTT 
His His Lys Phe 
300 

ATC ACT GGA CAT 
lie Thr Gly His 
315 



ATG TTG 
Met Leu 
295 

GTC CAG 
Val Gin 



TGG ATA 
Trp lie 
240 

ACC AAG 
Thr Lys 
255 

TTT TGC 
Phe Cys 



GCT TCT 
Ala Ser 



GAC TGC 
Asp Cys 



CGG ATG GCG 
Arg Met Ala 



S28 



575 



624 



672 



720 



76^ 



916 



912 



95" 



(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 319 amino acids 
(3) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 144: 

Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
10 15 

Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
25 * 30 



Met Ser Thr Asn Pro Lys 
1 5 



Arg Arg Pro Gin Asp Val 
20 
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Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Asp Arg Arg Pre Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Val 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Val Pro Val Ser Gly Leu 
130 135 190 

Gin Val Lys Asn Thr Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Gin 
195 200 205 

Asn Ser Ser He Val Trp Gin Leu Arg Asp Ala Val Leu His Val Pro 
210 215 220 

Gly Cys Val Pro Cys Glu Glu Lys Gly Asn He Ser Arg Cys Trp He 
225 230 235 240 

Pro Val Ser Pro Asn He Ala Val Ser Gin Pro Gly Ala Leu Thr Lys 
245 250 255 

Gly Leu Arg Thr His He Asp Thr He He Ala Ser Ala Thr Phe Cys 
2S0 2S5 270 

Ser Ala Leu Tyr He Gly Asp Leu Cys Gly Ala Val Met Leu Ala Ser 
275 230 285 

Gin Val Phe He He Ser Pro Gin His His Lys Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 1-15: 

( i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : mat _pep tide 

{B} LOCATION: 2.. 337 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 340 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 5 : 

C TCA ACG GTC ACS GAG AGG GAC A7C AGA ACT GAG GAG TCC ATA TAC 4 6 

Ser Thr Val Thr Glu Arg Asp He Arg Thr Glu Giu Ser He Tyr 
15 10 15 

CTT GCT TGC TCT TTA CCC GAG CAG GCA CGG ACT GCC ATA CAC TCA CTG 94 
Leu Ala Cys Ser Leu Pro Giu Gin Ala Arg Thr Ala lie Kis Ser Leu 
20 25 30 

ACT GAG AGG CTT TAC GTG GGA GGG CCC ATG CTA AAC AGO AAA GGG CAA 14 2 

Thr Giu Arg Leu Tyr Val Gly Giy Pro Met Leu Asn Ser Lys Giy Gin 
35 40 45 

ACT TGC GGA TAC AGA CGC TGC CGC GCC AGC GGA GTG TTC ACC ACT AGC 190 
Thr Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser 
50 55 60 

ATG GGA AAT ACC ATC ACG TGC TAC GTG AAG GCA CAA GCA GCC TGT AAG 2 38 

Met Giy Asn Thr He Thr Cys Tyr Val Lys Ala Gin Ala Ala Cys Lys 
65 70 75 

GCT GCG GGC ATA ATT GCC CCC ACG ATG CTG GTG TGC GGC GAC GAT CTA 286 
Ala Ala Gly He He Ala Pro Thr Met Leu Val Cys Gly Asp Asp Leu 
80 85 90 95 

GTT GTC ATC TCA GAG AGT CAG GGG ACC GAG GAG GAC GAG CGG AAC CTA 3 34 

Val Val He Ser Glu Ser Gin Gly Thr Glu Glu Asp Glu Arg Asn 'Leu 
100 105 110 

CGA GCC 340 
Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 146: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

Ser Thr Val Thr Glu Arg Asp lie Arg Thr Glu Glu Ser lie Tyr Leu 
IS 10 is 

Ala Cys Ser Leu Pro Glu Gin Ala Arg Thr Ala lie His Ser Leu Thr 
20 2S 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met: Leu Asn Ser Lys Gly Gin Thr 
35 40 45 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser Met 
50 55 60 

Gly Asn Thr lie Thr Cys Tyr Val Lys Ala Gin Ala Ala Cys Lys Ala 
65 70 75 80 

Ala Gly lie lie Ala Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val 
85 90 95 

Val lie Ser Glu Ser Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg 
100 105 110 

Ala 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1. . 345 

(ix) FEATURE : 

(A) NAME / KEY : mat_peptide 
(3) LOCATION: 1 . . 342 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

M> C T 

ATG AGC ACA CTT CCT AAA CCA CAA AGA AAA ACC AAA AGA AAC ACC AAC 4 8 
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Met Ser 
1 




Thr Leu 



Pro 
S 



Lys Pro Gin 



198 



Arg Lys Thr Lys 
10 



PCT/EP94/01323 



Arg Asn Thr Asn 
15 



CCC GGC CAC AGG ACG TTA AGT TCC CAG GCG GCG GTC 
Pro Gly His Arg Thr Leu Ser Ser Gin Ala Ala Val 
20 25 



AGA TCG TTG GTG 
Arg Ser Leu Val 
30 



96 



GAG 
Glu Phe 



Thr Cys 
35 



TAG 
Tyr 



CAC 
His 



GCA GGG 
Ala Gly 

40 



GCC CCC AGT TGG 
Ala Pro Ser Trp 



GTG TGC GTG CAG 
Val Cys Val Gin 
45 



144 



TGC GCA 
Cys Ala 
50 



AGA CTT 
Arg Leu 



CCG 
Pro 



AGC 
Ser 



GGT CGC 
Gly Arg 
55 



AAC CTC GCA GTA 
Asn Leu Ala Val 

60 



GGC GCC AAC CCA 
Gly Ala Asn Pro 



192 



TCC CCA 
Ser Pro 
65 



GGG CGC 
Gly Arg 



GCC 
Ala 



GAA 
Glu 
70 



CCG AGG 
Pro Arg 



GCA GGT CCT GGG 
Ala Gly Pro Gly 
75 



CTC AGC CCG GGT 
Leu Ser Pro Gly 
80 



240 



ACC CTT 
Thr Leu 



GGC CCC 
Gly Pro 



TAT 
Tyr 
85 



ATG 
Met 



GGA ATG 
Gly Met 



AGG GCT GCG GGT 
Arg Ala Ala Gly 
90 



GGG CAG GGT GGC 
Gly Gin Gly Gly 
95 



2 8 S 



TCC TGT CCC CGC GCG GCT CTC GCC 
Ser Cys Pro Arg Ala Ala Leu Ala 
100 



CGT CGT GGG GCC 
Arg Arg Gly Ala 
1C5 



CAA ATG ACC CCC 
Gin Met Thr Pro 
110 



33: 



GGC GCA GGA 
Gly Ala Gly 
115 



345 



(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 115 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 148: 
Met Ser Thr Leu Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Pro Gly His Arg Thr Leu Ser Ser Gin Ala Ala Val Arg Ser Leu Val 
20 25 30 

Glu Phe Thr Cys Tyr His Ala Gly Ala Pro Ser Trp Val Cys Val Gin 
35 40 45 

Cys Ala Arg Leu Pro Ser Gly Arg Asn Leu Ala Val Gly Ala Asn Pro 
50 55 60 

Ser Pro Gly Arg Ala Glu Pro Arg Ala Gly Pro Gly Leu Ser Pro Gly 
65 70 75 80 

Thr Leu Gly Pro Tyr Met Gly Met Arg Ala Ala Gly Gly Gin Gly Gly 
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85 90 95 

Ser Cys Pro Arg Ala Ala Leu Ala Arg Arg Gly Ala Gin Met Thr Pro 
100 105 110 

Gly Ala Gly 
115 

(2) INFORMATION FOR SEQ XD NO : 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE3S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
(3) LOCATION: 2 . .230 

(ix) FEATURE: 

(A) NAME / KEY : mac_pe?cide 
(3) LOCATION: 2. .277 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 9: 

G GCC TGT GAC CTC AAG GAC GAG GCT AGG AGG GTG ATA ACT TCA CTC 4 6 

Ala Cys Asp Leu Lys Asp Giu Ala Arg Arg Val lie Thr Ser Leu 
1*5 10 15 

ACG GAG CGG CTT TAC TGT GGT GGT CCT ATG TTC AAC AGC AAG GGA CAA 94 
Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly Gin 
20 25 30 

CAC TGC GGT TAC CGC CGC TGC CGT GCT AGT GGG GTG CTA CCC ACC AGC 14 2 

His Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr Ser 
35 40 45 

TTC GGG AAC ACA ATC ACC TGT TAC ATC AAA GCA AAG GCA GCT ACC AAA 190 
Phe Gly Asn Thr He Thr Cys Tyr He Lys Ala Lys Ala Ala Thr Lys 
50 55 60 

GCT GCC GGA ATT AAA AAT CCA TCA TTC CTT GTC TGC GGA GAT GAC TTG 23 8 

Ala Ala Gly He Lys Asn Pro Ser Phe Leu Val Cys Gly Asp Asp- Leu 
65 * 70 75 

GTC GTG ATT GCT GAG AGT GCA GGG ATC GAT GAG GAC AGA GCG 280 
Val Val lie Ala Glu Ser Ala Gly He Asp Giu Asp Arg Ala 
80 85 90 

(2) INFORMATION FOR SEQ ID NO: 150: 

{ i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 3 amino acids 
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(B) TYPE: amino acid 
{ D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: procein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



Ala Cys Asp Leu Lys Asp Glu Ala Arg Arg Val He Thr Ser Leu Th^ 

10 15 



Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly Gin His 
20 2 5 30 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr Ser Phe 
35 40 45 

Gly Asn Thr lie Thr Cys Tyr lie Lys Ala Lys Ala Ala Thr Lys Ala 
50 55 60 

Ala Gly lie Lys Asn Pro Ser Phe Leu Val Cys Gly Aso Aso Leu Val 

Val lie Ala Glu Ser Ala Gly lie Asp Glu Asp Arg Ala 
85 90 

(2) INFORMATION FOR SEQ ID NO: 151: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(3) LOCATION: 1 . .499 

(ix) FEATURE: 

(A) NAME / KEY : ma t_pept ide 

(3) LOCATION: 1..496* 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 151: 

ATG AGO ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACC AAC 
Mec Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 10 



15 



48 



CGT CGC CCA CAG GAC GTC AAG TTC CCG GGC GOT GGT CAG ATC GTT GGC 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 
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GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGG ATG GGT GTG CGC GCG 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Met Gly Val Arg Ala 
35 40 45 



144 



ACT 
Thr 



CGG AAG 
Arg Lys 

30 



ACT TCG GAA CGG TCG CAA CCC CGT 
Thr Ser Glu Arg Ser Gin Pro Arg 



GGA CGG CGT 
Gly Arg Arg 

SO 



CAG CCT 
Gin Pro 



192 



ATT CCC AAG GCG CGC CAG CCC ACG GGC CGG TCC TGG GGT CAA CCC GGG 
lie Pro Lys Ala Arg Gin Pro Thr Gly Arg Ser Trp Gly Gin Pro Gly 
65 70 73 30 



240 



TAC 
Tvr 



CCT TGG 
Pro Tro 



CCC CTT 
Pro Leu 
85 



GCC AAT GAG 
Tvr Ala Asn Glu 



GGC CTC 
Gly Leu 
90 



GGG TGG GCA 
Gly Trp Ala 



GGG TGG 
Gly Trp 
95 



233 



CTG CTC TCC CCT CGA GGC TCT CGG CCT AAT TGG GGC CCC AAT GAC CCC 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp Pro 
100 105 110 



336 



CGG CGA AAA TCG CGT AAT TTG GGT AAG 
Arg Arg Lys Ser Arg Asn Leu Gly Lys 
115 120 



GTC ATC 
Val lie 



GAT ACC CTA 
Asp Thr Leu 
125 



ACG TGC 
Thr Cvs 



384 



GGA 

Glv 



TTC GCC 
Phe Ala 
130 



GAT CTC ATG GGG TAT ATC CCG CTC 
Asp Leu Met Gly Tyr lie Pro Leu 



GTA GGC GGC 
Val Gly Gly 
140 



CCC ATT 
Pro lie 



432 



GGG GGC GTC GCA AGG GCT CTC GCA CAC GGT GTG AGG GTC CTT GAG GAC 
Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 



430 



GGG 
Gly 



GTA AAC 
Val Asn 



TAT GCA 
Tyr Ala 
165 



ACA G 
Thr 



499 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 16 6 amino acids 
(3) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Met: Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Met Gly Val Arg Ala 
35 40 45 
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Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 so 

He Pro Lys Ala Arg Gin Pro Thr Gly Arg Ser Trp Gly Gin Pro Gly 
65 7 0 75 ao 

Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg Arg Lys Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 X25 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Gly Pro He 

130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Aso 

145 150 155 160 

Gly Val Asn Tyr Ala Thr 
165 

(2) INFORMATION FOR SEQ ID NC : 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A} NAME /KEY : CDS 

(3) LOCATION: 1 . .579 

(ix) FEATURE: 

(A) NAME / KEY : mat^peptide 

(3) LOCATION: 1. .576 



{xi} SEQUENCE DESCRIPTION : SEQ ID NO: 153: 

ACG TGC GGA TTC GCC GAT CTC ATG GGG TAC ATC CCG CTC GTA GGC GGC 4S 
Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Gly 
1 5 10 15 

CCC GTT GGG GGC GTC GCA AGG GCT~ CTC GCA CAC GGT GTG AGG GTC CTT 96 
Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 
20 25 30 

GAG GAC GGG GTA AAC TAT CCA ACA GGG AAT TTA CCC GGT TGC TCT TTC 14 4 
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Olu Asp Gly Val Asn Tyr Pro Thr Gly Asn Leu Pro Gly Cys Ser Phe 



140 
130 



CTC TGC TCC GCG TTA TAC GTA GGA GAG GCG TG. G~ G~ C ~ — 
Leu Cvs Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly -a Le- .... ^ 
145 150 15= 

GTA GGC CAA ATG TTC ACC TAT AGG CCT CGC CAC- CAC GCT ACS GTG GAG 
Val Gly Gin Met Phe Thr Tyr Arg Pro Arg Gin H,s A,a Th. Va- 



las 190 
130 13b 



GCG 
Ala 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 15 3 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 
Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Gly 

T 5 



192 



240 



40 45 
35 4U 

TCT ATC TTT ATT CTT GCT CTT CTC TCG TGT CTG ACC GTT CCG GCC TCT 
lie pH lie Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 53 60 

„. c ~ XAC CGA AAT GCC TCT C-GG ATT TAT CAT GTT ACC AAT GAT 

Gww — i" 1 - , Tl 3i s val Thr Asn Asp 

Ala val. Pro Tyr Arg Asn Ala Ser Gxy He Tyr ~ va_ 

65 70 75 

TGC C-A AAC TCT TCC ATA GTC TAT GAG GCA GAT AAC CTG ATC CTA CAC 

Cys Pro" £n Ser Ser He Val Tyr Glu Ala Asp Asn Leu He Leu rfx, 
85 90 

GCA CCT GGT TGC GTG CCT TGT GTC ATG ACA GGT AAT GTG AGT AGA TGC 
S Pro Z Cys Val Pro Cys Val Mec Thr Gly Asn Val Ser Arg Cys 
ioo 105 1 

TGG GTC CAA ATT ACC CCT ACA CTG TCA GCC CCG AGC CTC GGA GCA GTC 38- 
TrD Val Gin He Thr Pro Thr Leu Ser Ala Pro Ser Leu G.y rt ,a Va. 
115 120 

*C GCr CCT" c-T CGG AGA GCC GTT GAC TAC CTA GCG GGA GGG GCT GCC 4 3 

Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly Axa .la 



238 



33c 



480 



523 



576 



579 
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Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 
20 25 30 

Glu Asp Gly Val Asn Tyr Pro Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 



Ser He Phe He Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala 



50 



Ser 



60 



Ala Val Pro Tyr Arg Asn Ala Ser Gly He Tyr His Val Thr Asn Asp 
65 7 ° 75 go 

Cys Pro Asn Ser Ser He Val Tyr Glu Ala Asp Asn Leu He Leu His 
85 90 95 

Ala Pro Gly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser Arg Cys 
100 105 no 

Trp Val Gin He Thr Pro Thr Leu Ser Ala Pre Ser Leu Gly Ala Val 
115 120 125 

Thr Ala Pro Leu Arg Arg Ala Val As? Tyr Leu Ala Glv Gly Ala Ala 
130 135 140 

Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu 
145 150 155 " iso 

Val Gly Gin Met Phe Thr Tyr Arg Pro Arg Gin His Ala Thr Val Gin 
165 170 175 

Asn Cys Asn Cys Ser He Tyr Ser Gly His Val Thr Gly His Arg Met 
130 135 * 190 



Ala 



(2) INFORMATION FOR SEQ ID NO: 15 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 
(iii) ANT I- SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(3) LOCATION: 1 . .5 79 

(ix) FEATURE: 

(A) NAME / KEY : mat_peptide 

(B) LOCATION: 1 . . 576 
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(xi) 

ACG TGC 
Thr Cys 
1 

CCC GTT 
Pro Val 



GAG GAC 
Glu Asp 



TCT ATC 
Ser He 
50 



SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

GGA TTC GCC GAC CTC GTG GGG TAC ATC CCG 
Glv Phe Ala Asp Leu Val Gly Tyr He Pro 
5 10 

GGG GGC GTC GCA AGG GCT CTC GCA CAT GGT 
Gly Gly Val Ala Arg Ala Leu Ala His Gly 
20 25 

GGG GTG AAT TAT GCA ACA GGG AAT CTG CCT 
Glv Val Asn Tyr Ala Thr Gly Asn Leu Pro 

35 40 

TTC ATT CTT GCA CTT CTC TCG TGC CTC ACT 
Phe He Leu Ala Leu Leu Ser Cys Leu Thr 
55 60 



CTC GTA GGC GGC 
Leu Val Gly Gly 
15 

GTG AGG GTT CTT 
Val Arg Val Leu 
30 

GGT TGC TCT TTC 
Gly Cys Ser Phe 
45 

GTC CCG GCC TCT 
Val Pro Ala Ser 



GCA GTT 
Ala Val 
65 

TGC CCA 
Cvs Pro 



GCA CCT 
Ala Pro 



TGG GTC 
Trt> Val 



CCC TAC CGA AAT GCC TCT GGG ATC TAT CAT 
Pro Tyr Arg Asn Ala Ser Gly He Tyr His 
7C 73 

AAC TCT TCC ATA GTC TAT GAG GCA GAT GAT 
Asr Ser S er He Val Tyr Glu Ala As? As? 
35 90 

GGC TGC GTG CCT TGT GTC AGG AAA GAT AAT 
Gly Cvs Val Pro Cys Val Arg Lys As? Asn 

100 105 

CAA A~^ ACC CCC ACG CTG TCA GCC CCG AGC 
Gin He Thr Pro Thr Leu Ser Ala Pro Ser 
115 120 



GTC ACC 
Val Thr 



CTG ATC 
Leu He 



GTG AGT 
Val Ser 
110 



AAT GAT 
Asn As? 

8 0 

CTA CAC 
Leu His 
95 

AGG TGC 
Arg Cys 



ACG GCT 
Thr Ala 
130 

CTC TGC 
Leu Cys 
145 

GTA GGC 
Val Gly 



GAC TGC 
Asp Cys 



CCC C^T CGG AGA GCC GTT GAT TAC TTG GTG 
Pro Leu Arg Arg Ala Val As? Tyr Leu Val 
135 140 

T C- GCG TTA TAC GTT GGA GAC GCG TGT GGG 
Se- Ala Leu Tyr Val Gly As? Ala Cys Giy 
150 155 

ic,G CCT CGC CAG CAT 
CAA ATG TTC AC. T^ ^ ^ 

Gin Met Phe Thr Tyr Arg Pro Arg Gin H-s 
165 170 

AAC TGT TCC ATC TAC AGT GGC CAC GTC ACC 
Asn Cvs Ser He Tyr Ser Gly His Va_ T.r 
18 0 135 



TTC GGA 
phe Gly 
125 

GGA GGG 

Gly Gly 



GCA GTC 
Ala val 



GCT GCC 
Ala Ala 



GCA CTA TTT TTG 
Ala Leu Phe Leu 
1S0 

GCT ACG GTG CAG 
Ala Thr Val Gin 
175 

GGC CAT CAG ATG 
Gly His Gin Me- 
190 



43 



96 



144 



192 



240 



23a 



336 



384 



4 3^ 



4S0 



528 



576 



579 



GCA 
Ala 



(2) INFORMATION FOR SEQ ID NO: 156: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

Thr Cys Gly Phe Ala Asp Leu Vai Gly Tyr lie Pro Leu Val Glv Gly 
1 3 10 is 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 
20 25 30 

Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Phe He Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 so 

Ala Val Pro Tyr Arg Asn Ala Ser Gly He Tyr His Vai Thr Asn Asc 
65 7 0 75 ac 

Cys Pro Asn Ser Ser He Val Tyr Glu Ala Asp Asp Leu lie Leu Kis 
85 90 95 

Ala Pro Gly Cys Val Pro Cys Val Arg Lys Asp Asn Vai Ser Arg Cys 
100 105 no 

Trp Vai Gin He Thr Pro Thr Leu Ser Ala Pro Ser Phe Gly Ala Val 
115^ 120 125 

Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Val Gly Giy Ala Ala 
130 135 140 

Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu 
145 150 155 150 

Val Giy Gin Met Phe Thr Tyr Arg Pro Arg Gin His Ala Thr Val Gin 
165 170 175 

Asp Cys Asn Cys Ser He Tyr Ser Gly His Val Thr Gly His Gin Mec 
180 185 190 

Ala 



{2} INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 30 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 
(iii) ANT I -SENSE : NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(3) LOCATION: 3 . . 53C 

(ix) FEATURE: 

(A) NAME/KEY: maz^peptide 

(B) LOCATION: 3.. 527 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CA CCT ACS ACA GCT CTG CTG GTG GCC CAG TTA CTG CGG ATT CCC CAA * 4 7 

Pro Thr Thr Ala Leu Leu Val Ala Gin Leu Leu Arg lie Pro Gin 
1 5 10 15 

GTG GTC ATT GAC ATC ATC GCA GGG AGC CAC TGG GGG GTC TTG TTT GCC 95 
Val Val He Asp He He Ala Gly Ser His Trp Giy Val Leu Phe Ala 
20 25 30 

GCC GCA TAC TAT GCA TCG GTG GCT AAC TGG ACC AAG GTC GTG CTG GTC 14 3 

Ala Ala Tyr Tyr Ala Ser Val Ala Asn Trp Thr Lys Val Val Leu Val 
35 40 45 

TTG TTT CTG TTT GCA GGG GTT GAT GCT ACT ACC CAG ATT TCG GGC GGC 191 
Leu Phe Leu Phe Ala Gly Val Asp Ala Thr Thr Gin He Ser Gly Gly 
50 55 60 

TCC AGC GCC CAA ACG ACG TAT GGC ATC GCC TCA TTT ATC ACC CGC GGC 23 9 

Ser Ser Ala Gin Thr Thr Tyr Gly He Ala Ser Phe He Thr Arg Gly 
65 70 75 

GCG CAG CAG AAA CTG CAG CTC ATA AAT ACC AAC GGA AGC TGG CAC ATC 29 7 

Ala Gin Gin Lys Leu Gin Leu He Asn Thr Asn Gly Ser Trp His He 
80 85 90 95 

AAC AGG ACC GCC CTT AAT TGT AAT GAC AGC CTC CAG ACT GGG TTC ATA 3 35 

Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe He 
100 105 110 

GCC GGC CTC TTC TAC TAC CAT AAG TTC AAC TCT TCT GGA TGC CCG GAT 383 
Ala Gly Leu Phe Tyr Tyr His Lys Phe Asn Ser Ser Gly Cys Pro Asp 
115 120 125 

CGG ATG GCT AGC TGT AGG GCC CTT GCC ACT TTT GAC CAG GGC TGG GGA 431 
Arg Met Ala Ser Cys Arg Ala Leu Ala Thr Phe Asp Gin Gly Trp Giy 
130 135 140 

ACT ATC AGC TAT GCC AAC ATA TCG ~GGT CCC ACT GAT GAC AAA CCA TAT 479 
Thr lie Ser Tyr Ala Asn He Ser Gly Pro Ser Asp Asp Lys Pro Tyr 
145 150 155 

TGC TGG CAC TAT CCC CCA CGG CCG TGC GGA GTG GTG CCA GCC CAA GAG 52 7 
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Cys Trp His Tyr Pro Pro Arg Pro Cys Gly Val Val Pro Ala Gin Glu 
160 16S 170 175 

GTC 530 
Val 



(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 amino acids 

(B) TYPE: amine acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 158: 

Pro Thr Thr Ala Leu Leu Val Ala Gin Leu Leu Arg lie Pro Gin Val 
15 10 15 

Val He Asp He He Ala Gly Ser His Trp Gly Val Leu Phe Ala Ala 
20 25 30 

Ala Tyr Tyr Ala Ser Val Ala Asn Trp Thr Lys Val Val Leu Val Leu 
35 40 45 

Phe Leu Phe Ala Gly Val Asp Ala Thr Thr Gin He Ser Gly Gly Ser 
50 55 60 

Ser Ala Gin Thr Thr Tyr Gly He Ala Ser Phe He Thr Arg Gly Ala 
65 70 75 80 

Gin Gin Lys Leu Gin Leu He Asn Thr Asn Gly Ser Trp Kis He Asn 
85 90 95 

Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe He Ala 
100 105 110 

Gly Leu Phe Tyr Tyr His Lys Phe Asn Ser Ser Gly Cys Pro Asp Arg 
115 120 125 

Met Ala Ser Cys Arg Ala Leu Ala Thr Phe Asp Gin Gly Trp Gly Thr 
130 135 140 

He Ser Tyr Ala Asn He Ser Gly Pro Ser Asp Asp Lys Pro Tyr Cys 
145 150 155 160 

Trp His Tyr Pro Pro Arg Pro Cys Gly Val Val Pro Ala Gin Glu Val 
165 170 175 



(2) INFORMATION FOR SEQ ID NO: 15 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(3) LOCATION: 2.. 340 

(ix) FEATURE : 

(A) NAME /KEY : ma c_pep tide 

(B) LOCATION: 2.. 337 



(xi) SEQUENCE DESCRIPTION: ScQ ID NO : 15 3: 

C TCG ACC GTT ACC GAA CAT GAC ATA ATG ACC GAA GAG TCC ATT TAC 4 6 

Ser Thr Val Thr Glu His Asp lie Mec Thr Glu Glu Ser lie Tyr 
15 10 15 

CAA TCA TGT GAC TTG CAG CCC GAG GCA CGC GCA GCA ATA CGG TCA CTC 94 
Gin Ser Cys Asp Leu Gin Pro Glu Ala Arg Ala Ala lie Arg Ser Leu 
20 25 30 

ACC CAA CGC CTC TAC TGT GGA GGC CCC ATG TAC AAC AGC AAG GGG CAA 14 2 

Thr Gin Arg Leu Tyr Cys Giy Gly Pro Met Tyr Asn Ser Lys Giy Gin 
35 40 45 

CAG TGT GGT TAT CGC AGA TGC CGC GCC AGC GGC GTT TTC ACC ACC AGT 190 
Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Giy Val Phe Thr Thr Ser 
50 55 60 

ATG GGC AAC ACC ATG ACG TGC TAC ATC AAG GCT TTA GCC TCC TGT AGA 23 3 

Mec Gly Asn Thr Mec Thr Cys Tyr He Lys Ala Leu Ala Ser Cys Arg 
65 70 75 

GCC GCA AGG CTC CGG GAC TGC ACG CTC CTG GTG TGT GGT GAC GAT CTT 2 86 

Ala Ala Arg Leu Arg Asp Cys Thr Leu Leu Val Cys Gly Asp Asp Leu 
80 85 90 95 

GTG GCC ATC TGC GAG AGC CAG GGG ACA CAC GAG GAT GAA GCA AGC CTG 3 34 

Val Ala He Cys Glu Ser Gin Gly Thr His Glu Asp Glu Ala Ser Leu 
100 105 110 

AGA GCC 34 0 

Arg Ala 



(2) INFORMATION FOR 

(i) SEQUENCE 
(A) LENGTr 
(3) TYPE: 



SEQ ID NO: 16 0: 

CHARACTERISTICS : 

113 amino acids 
amino acid 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
Ser Thr Val Thr Glu His Asp He Met Thr Glu Glu Ser He Tvr Gin 

1 5 io : 3 

Ser Cys Asp Leu Gin Pro Glu Ala Arg Ala Ala He Arg Ser Leu Thr 
20 25 30 

Gin Arg Leu Tyr Cys Gly Gly Pro Met Tyr Asn Ser Lys Glv Gin Gin 
35 40 45 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser Met 
50 55 so 

Gly Asn Thr Met Thr Cys Tyr He Lys Ala Leu Ala Ser Cys Arg Ala 
65 70 75 80 

Ala Arg Leu Arg Asp Cys Thr Leu Leu Val Cys Gly Asp Asn Leu Val 
95 90 * " " 95 

Ala He Cys Glu Ser Gin Gly Thr His Glu Asp Glu Ala Ser Leu Arcr 
100 105 * no 

Ala 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . .340 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(3) LOCATION: 2 . . 33 7 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

C TCA ACC GCC ACC GAA CAT GAC ATA TTG ACT GAA GAG TCC ATA TAC 4 6 

Ser Thr Ala Thr Glu His Asp He Leu Thr Glu Glu Ser He Tyr 
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15 10 15 

CAA TCA TGT GAC TCG CAG CCC GAC GCA CGC GCA GCA ATA CGG TCA CTC 94 
Gin Ser Cys Asp Ser Gin Pro Asp Ala Arg Ala Ala He Arg Ser Leu 
20 25 30 

■ACC CAA CGC TTG TTC TGT GGA GGC CCC ATG TAT AAC AGC AAG GGG CAA 142 
Thr Gin Arg Leu Phe Cys Gly Gly Pro Met Tyr Asn Ser Lys Gly Gin 
35 40 45 

CAA TGT GGT TAT CGC AGA TGC CGC GCC AGC GGC GTC TTC ACC ACC AGT 190 
Gin Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser 
50 55 60 

ATG GGC AAC ACC ATG ACG TGC TAC ATT AAG GCT TTA GCC TCC TGT AGA 238 
Met Gly Asn Thr Met Thr Cys Tyr He Lys Ala Leu Ala Ser Cys Arg 
65 7Q 75 

ACC GCT GGG CTC CGG GAC TAC ACG CTC CTG GTG TGT GGT GAC GAT CAT 286 
Thr Ala Gly Leu Arg Asp Tyr Thr Leu Leu Val Cys Gly Asp Asp His 
80 85 90 95 

GTG GCC ATC TGC GAG AGC CAG GGG ACA CAC GAG GAT GAA GCG AAC CTG 3 34 

Val Ala He Cys Glu Ser Gin Gly Thr Kis Glu Asp Glu Ala Asn Leu 
100 105 110 

AGA GCC 34 0 

Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 113 amino acids 
(3) TYPE : amine acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Ser Thr Ala Thr Glu His Asp He Leu Thr Glu Glu Ser He Tyr Gin 
15 10 15 

Ser Cys Asp Ser Gin Pro As? Ala Arg Ala Ala He Arg Ser Leu Thr 

20 25 30 

Gin Arg Leu Phe Cys Gly Gly Pro Met Tyr Asn Ser Lys Gly Gin Gin 
35 40 45 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser Met 

50 55 60 

Gly Asn Thr Met Thr Cys Tyr He- Lys Ala Leu Ala Ser Cys Arg Thr 
65 70 75 80 

Ala Gly Leu Arg Asp Tyr Thr Leu Leu Val Cys Gly Asp Asp His Val 
85 90 95 
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Ala lie Cys Glu Ser Gin Gly Thr His Glu Asp Glu Ala Asn Leu Arg 
100 105 no 

Ala 



(2) INFORMATION FOR SEQ ID NC : 153: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 99 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 



*YS (ix) FEATURE 

la* 



(A) NAME/KEY: COS 
(E ) LOCATION: I. .4 99 



*JH (ix) FEATURE: 

(A) NAME / KEY : mat_pepcide 

(B) LOCATION: 1 . .4 96 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 153: 



• SSKF 



jjj ATG AGC ACG AAT CCT AAA CTT CAA AGA AAA ACC AAA CGT AAC ACC AAC 4 3 

Met Ser Thr Asn Pro Lys Leu Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 



CGC CGC CCC ATG GAC GTT AAG TTC CCG GGT GGT GGC CAG ATC GTT GGC 95 
Arg Arg Pro Met Asp Vai Lys Phe Pro Gly Gly Gly Gin lie Vai Gly 
20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGG TTG GGT GTG CGC GCG 14 4 

Gly Vai Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Vai Arg Ala 
35 40 45 

ACT CGG AAG ACT TCG GAG CGG TCG CAA CCT CGT GGG AGG CGC CAA CCT 19 2 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

ATC CCC AAG GCG CGC CGA TCC GAG GGC AGA TCC TGG GCG CAG CCC GGG 24 0 

lie Pro Lys Ala Arg Arg Ser Glu Gly Arg Ser Trp Ala Gin Pro Gly 
65 70 75 80 

TAT CCT TGG CCC CTT TAC GGC AAT ~GAG GGC TGT GGG TGG GCA GGG TGG 29 3 

Tvr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

CTC CTG TCC CCT CGC GGG TCT CGG CCG TCT TGG GGC CCT AAT GAT CCC 3 35 
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Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

CGG CGG AGG TCC CGC AAC CTG GGT AAG GTC ATC GAT ACC CTA ACA TGC 384 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

GGC TTC GCC GAC CTC AT 3 GGA TAC ATC CCG CTT GTA GGC GCC CCC GTG 432 
Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Val 
130 135 140 

GGT GGC GTC GCC AGA GCC CTG GCA CAC GGT GTT AGG GCT GTG GAA GAC 4 30 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val Glu Asp 
145 150 155 160 

GGG ATC AAC TAC GCA ACA G 4 99 

Gly He Asn Tyr Ala Thr 
165 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 166 amino acids 
(3) TYPE: amine acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Met Ser Thr Asn Pro Lys Leu Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Met Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Ser Glu Gly Arg Ser Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 - 125 

Gly Phe Ala As? Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Val 
130 135 140 
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Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val Glu Asd 

145 150 155 1S q 

Gly lie Asn Tyr Ala Thr 
165 



(2) INFORMATION FOR SEQ ID NC : 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
<iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 



240 
300 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAAC3TA ACACCAACCG CCGCCCTATG SO 
GACGTTAAGT TCCCAGGCGG TGGTCAGATC GTTGGC3GAG TTTACTTGTT GCCGCGCAGG 12 0 

GGCCCCAGGT TGGGTGTGCG CGCGACTCGG AAGACTTCGG AGCGGTCGCA ACCTCGTGGG . ISO 
AGGCGCCAAC CTA7CCCCAA GGCGCGCCGA ACCGAGGGCA GATCCTGGGC GCAGCCCGGG 
TATCCTTGGC CCCTTTACGG CAATGAGGGC TGTGGGTGGG CAGGGTGGCT CCTGTCCCCT 
CGCGGNTCTC GGNCGTCTTG GGGCCCCAAT GATCCCCGGN GGAGATCCCG CAACTTGGGT 35 0 

AAGGTCATCG ATACCCTAAC ATGCGGCTTC GCCGACCTCA TGGGATACAT CCCGCTTGTA 42 0 

GGCGCCCCCG TGGGTGGCGT CGCCAGGGCC CTGGCACATG GTGTTAGGGC TGTGGAAGAC 48 0 

GGGATCAATT ATGCAACAG 

(2) INFORMATION FOR SEQ ID NO: 16c: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 



499 
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15 10 15 

Arg Arg Pro Met: Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Ala Arg Arg Thr Glu Gly Arg Ser Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Xaa Ser Arg Xaa Ser Trp Gly Pro Asn Asp Pro- 
100 105 110 

Arg Xaa Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu 

115 120 12S 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANT I- SENSE: NO 



FEATURE : 
(A) NAME / KEY : 
(3) LOCATION: 

FEATURE : 

(A) NAME / KEY : 
(3) LOCATION: 



CDS 

1. .579 



mat_peptide 
1. .579 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

ACA TGC GGC TTC GCC GAC CTC ATG GGA TAC ATC CCG CTT GTA GGC GCC 4 8 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
15 10 15 

CCC GTG GGT GGC GTC GCC AGG GCC'CTG GCA CAT GGT GTT AGG GCT GTG 96 
Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

GAA GAC GGG ATC AAT TAT GCA ACA GGG AAC CTT CCC GGT TGC TCC TTT 144 
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35 




Asn Tyr Ala 



Thr 
40 
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Gly Asn Leu Pro 



PCT/EP94/01323 



Gly Cys Ser Phe 
45 



TCT ATC TTC CTC TTG GCG CTC CTC TCG TGC CTG ACT 
Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr 
50 55 go 



GTT CCC ACA TCG 
Val Pro Thr Ser 



19: 



GCC GTT AAC TAT CGC AAT GCT TCG GGC ATT 
Ala Val Asn Tyr Arg Asn Ala Ser Gly He 
65 70 

TGC CCG AAT GCA AGC ATA GTG TAC GAG ACC 
Cys Pro Asn Ala Ser He Val Tyr Glu Thr 
95 90 

CTC CCA GGG TGC GTA CCC TGT GTG AGG ACT 
Leu Pro Gly Cys Val Pro Cys Val Arg Thr 
100 105 

TGG GTG GCC CTC ACT CCC ACA GTA GCG TCG 
Trp Val Ala Leu Thr Pro Thr Val Ala Ser 

115 no 



TAT CAC 
Tyr His 
75 

GAA AAT 
Glu Asn 



GGG AAC 
Gly Asn 



CCA TAC 
Pro Tyr 



ATC ACC 
He Thr 



CAC ATC 
His He 



CAG TCG 
Gin Ser 
110 

GCC GGT 
Ala Gly 
125 



AAT GAC 
Asn Asp 
80 

TTA CAC 
Leu His 
95 

CGG TGT 
Arg Cys 



GCT CCG 
Ala Pro 



240 



288 



336 



334 



CTT GAG CCC TTG CGG CGT CAT 
Leu Glu Pro Leu Arg Arg His 
130 135 



GTG GAC CTG ATG GTA GGT GCT GCC ACC 
Val Asp Leu Men Val Gly Ala Ala Thr 
140 



432 



m 



ATG TGT TCC GCC 
Met Cys Ser Ala 
145 

GTG GGC CAA ATG 
Val Gly Gin Met 



CTC 
Leu 



TTC 
Phe 
165 



TAC 
Tyr 
150 

ACC 
Thr 



ATC 
lie 



Phs 



GGC GAC TTG TGC GGT 
Gly Asp Leu Cys Gly 
155 

CAA CCG CGA CGT CAC 
Gin Pro Arg Arg His 
170 



GGC TTA TTC TTG 
Gly Leu Phe Leu 
160 

TGG ACC ACT CAG 
Trp Thr Thr Gin 
175 



480 



528 



GAC TGC AAT TGT TCC 
Asp Cys Asn Cys Ser 
180 



VTC TAC ACG GGC CAC ATT ACG 
:ie Tyr Thr Gly His He Thr 
135 



GGT CAT CGG ATG 
Gly His Arg Met 
190 



576 



GCA 
Ala 



579 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 163: 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
15 10 15 
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Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly lie Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser lie Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 

50 35 60 

Ala Val Asn Tyr Arg Asn Ala Ser Gly lie Tyr His lie Thr Asn Asp 
65 70 75 80 

Cys Pro Asn Ala Ser lie Val Tyr Glu Thr Glu Asn His lie Leu His 
85 90 95 

Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Gin Ser Arg Cys 
100 105 HO 

Trp Val Ala Leu Thr Pro Thr Val Ala Ser Pro Tyr Ala Gly Ala Pro 
115 120 125 

Leu Glu Pro Leu Arg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 

Met Cys Ser Ala Leu Tyr lie Gly Asp Leu Cys Gly Gly Leu Phe Leu 
145 150 155 ISO 

Val Gly Gin Met Phe Thr Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
165 170 175 

Asp Cys Asn Cys Ser lie Tyr Thr Gly His He Thr Gly His Arg Met 
130 135 190 



Ala 



(2) INFORMATION FOR SEQ ID NO: 15 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 579 

(ix) FEATURE : 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1. .576 



SUBSTITUTE SHEET (RULE 26) 



WO 94/25601 




PCT/EP94/01323 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 9: 

ACA TGC GGC TTC GCC GAC CTC ATG GGA TAC ATC CCG CTT GTA GGC GCC 
Thr Cys Gly Phe Ala Asp Leu Met: Gly Tyr lie Pro Leu Val Gly Ala 
15 io X s 



GGC G7T AAC TAT CGC AAT GCT TCG GGC GTT TAT CAC ATC ACC AAC GAC 
Q Gly Val Asn Tyr Arg Asn Ala Ser Gly Val Tyr His He Thr Asn Asr; 

•fi 65 7 0 75 8 0 



48 



CCC GTG GGT GGC GTC GCC AGA GCC CTG GCA CAC GGT GTT AGG GCT GTG 96 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 

20 25 3Q 

GAA GAC GGG ATC AAC TAC GCA ACA GGG AAT CTC CCC GGT TGC TCC TTT 144 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 

35 40 45 

TCT ATC TTC CTC TTG GCA CTT CTC TCG TGC CTC ACT GTT CCC GCG TCG 192 

Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 

50 55 " 60 



240 



288 



336 



03 

\J TGC CCG AAT GCG AGC ATA GTG TAC GAG ACC GAC AAT CAC ATC TTA CAC 

bj c, / s ?ro Asn Ala Ser He Val Tyr Glu Thr Asp Asn His He Leu His 

fU 85 90 95 

ft i 

'Z CCA GGG TGC GTA CCC TGT GTG AAG ACC GGG AAC CAG TCG CGG TGT 

+ " Leu Pro Gly Cys Val Pro Cys Val Lys Thr Gly Asn Gin Ser Arg Cys 

* 100 105 110 

8} TGG GTG GCC CTC ACT CCC ACA GTG GCG TCG CCT TAC GTC GGT GCT CCG 3 84 

Q Trp Val Ala Leu Thr Pro Thr Val Ala Ser Pro Tyr Val Gly Ala Pro 

111 115 120 * 125 

j«& CTC GAG C <2C TTG CGG CGC CAT GTG GAC CTG ATG GTA GGT GCT GCC ACC 43 2 

Leu Glu Pro Leu Arg Arg His Val Asp Leu Me- Val Gly Ala Ala Thr 

130 135 140 

GTG TGC TCC GCC CTC TAC GTC GGC GAC CTG TGC GGT GGC TTA TTC TTG- 48 0 

Val Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Leu Phe Leu * 
14 = 150 155 160 

GTA GGC CAA ATG TTC ACC TTC CAA CCG CGA CGC CAC TGG ACG ACC CAG 52 3 

Val Gly Gin Met Phe Thr Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
155 170 175 

GAC TGT AAT TGT TCC ATC TAC GCA GGG CAT ATT ACG GGC CAT CGG ATG 5 76 

Asp Cys Asn Cys Ser lie Tyr Ala Gly His lie Thr Gly His Arg Met: 
180 185 190 

GCT 579 
Ala 



(2) INFORMATION FOR SEQ ID NO: 170: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC : 170: 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Giy Ala 
1 S 10 15 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly lie Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 

Gly Val Asn Tyr Arg Asn Ala Ser Gly Val Tyr His lie Thr Asn As? 
65 70 75 80 

Cys Pro Asn Ala Ser He Val Tyr Glu Thr Asp Asn His He Leu His 
85 90 95 

Leu Pro Gly Cys Val Pro Cys Val Lys Thr Gly Asn Gin Ser Arg Cys 
100 105 HO 

Trp Val Ala Leu Thr Pro Thr Val Ala Ser Pro Tyr Val Giy Ala Pro 
115 120 125 

Leu Glu Pro Leu Arg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 

Vai Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Leu Phe Leu 
145 150 155 ISO 

Vai Glv Gin Met Phe Thr Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
165 170 175 

Asd Cys Asn Cvs Ser He Tyr Ala Gly His He Thr Gly His Arg Met 
180 135 190 

Ala 



(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 9 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNES3 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(3 ) LOCATION : 1. .579 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1. .576 



{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

ACA TGC GGC TTC GCC GAC CTC ATG GGA TAC ATC CCG CTT GTG GGC GCC 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Va : G 1 v Ala 
15 10 Is 

CCT GTT GGT GGC GTC GCC AGA GCC CTT GCG CAC GGC GTC AGG GCT GTG 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala Kis Gly Val Arc Ala Val 

20 , .25 30 



TCT ATC TTC CTT CTG GCA CTT CTC TCG TGC CTG ACT GTC CCC GCC TCG 

Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 GO 

GCT GTG CAT TAT CAC AAC ACC TCG GGC ATC TAC CAC CTC ACC AAT GAC 

Ala Val Kis Tyr His Asn Thr Ser Gly He Tyr His Leu Thr Asn Aso 

65 70 75 80 

TGC CCT AAC TCT AGC ATA GTC TTT GAG GCA GTC CAT CAC ATC TTG CAC 

Cys Pro Asn Ser Ser He Val Phe Glu Ala Val His His lie Leu His 
85 90 95 



48 



96 



GAA GAC GGG ATT AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCC TTT 144 
Glu As? Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 



192 



240 



233 



CTT CCA GGA TGC GTC CCT TGT GTA AGA ACT GGG AAC CAG TCT CGG TGC 3 3 5 

Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Gin Ser Arg Cys 
100 105 no 

TGG GTA GCC TTG ACC CCC ACG CTG GCC GCG CCA TAC CTT GGC GCT CCA 3 84 

Trp Val Ala Leu Thr Pro Thr Leu Ala Ala Pro Tyr Leu Gly Ala Pro 
115 120 125 

CTC GAG TCC ATG CGG CGT CAC GTG GAT TTG ATG GTG GGC ACT GCT ACA 43 2 

Leu Glu Ser Met Arg Arg His Val Asp Leu Met: Val Gly Thr Ala Thr 
130 135 14Q 

TTG TGC TCA GCA CTC TAC GTT GGG GAC CTG TGC GGG GGC ATA TTC CTA 4 30 

Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly lie Phe Leu 
145 150 155 * * ISO 

GCG GGC CAG ATG TTC ACC TTC CGG CCC CGC CTC CAT TGG ACC ACC CAG 52 3 

Ala Gly Gin Mec Phe Thr Phe Arg Pro Arg Leu His Trp Thr Thr Gin 

165 170 175 
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GAG TGC AAT TGT TCC ACC TAT CCG GGC CAC ATC ACG GGT CAT AGA ATG 576 
Glu Cys Asn Cys Ser Thr Tyr Pro Gly His lie Thr Gly His Arg Met 
180 185 190 

GCG 579 
Ala 



(2) INFORMATION FOR SEQ ID NO: 172: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr lie Pre Leu Val Gly Ala 
1 ' 5 10 15 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 

Ala Val His Tyr His Asn Thr Ser Gly He Tyr Kis Leu Thr Asn Asp 
65 . 70 75 SO 

Cys Pro Asn Ser Ser He Val Phe Glu Ala Val His His He Leu Kis 
85 90 95 

Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Gin Ser Arg Cys 
100 105 110 

Trp Val Ala Leu Thr Pro Thr Leu Ala Ala Pro Tyr Leu Gly Ala Pro 
115 120 125 

Leu Glu Ser Met Arg Arg His Val Asp Leu Met Val Gly Thr Ala Thr 
130 135 140 

Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly He Phe Leu 
145 150 155 ISO 

Ala Gly Gin Met Phe Thr Phe Arg Pro Arg Leu His Trp Thr Thr Gin 
165 170 175 

Glu Cys Asn Cys Ser Thr Tyr Pro Gly His lie Thr Gly His Arg Met 
ISO 135 190 



Ala 
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(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE3S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
{iii) ANTI-SENSE: NO 



m 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .579 

(ix) FEATURE : 

(A) NAME / KEY : mat_pe?: 

(B) LOCATION: 1 . . 575 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

ACQ TGC GGT TCC GCC GAC CTC ATG GGA TAG ATC CCG CTC GTA GGC GCC 4 3 

Thr Cys Gly Ser Ala Asp Leu Mec Gly Tyr lie Pro Leu Val Gly Ala 
15 10 15 

CCT GTG GGT GGC GTC GCC AGG GCC TTG GCG CAT GGC GTC AGG GCT GTG 96 
Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arc Ala Val 
20 25 30 

GAG GAC GGG ATA AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCT TTT 144 
Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

TCT ATC TTC CTT CTG GCA CTT CTC TCG TGC CTG ACT GTC CCC GCC TCA 192 
Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 

GCT GTG CAT TAT CAC AAC ACC TCG GGC ATC TAT CAC ATC ACT AAT GAC 24 0 

Ala Val His Tyr His Asn Thr Ser Gly He Tyr His He Thr Asn Asp 
65 70 75 .80 

TGC CCT AAC TCT AGC ATA GTC TTT GAG GCA GAG CAT CAC ATC TTG CAT 288 
Cys Pro Asn Ser Ser He Val Phe Glu Ala Glu His His He Leu His 
85 90 95 

CTT CCA GGA TGC GTC CCC TGT GTG AGA ACT GGG AAC CAG TCA CGA TGC 3 36 

Leu Pro Gly Cys Val Pro Cys Val ~Arg Thr Gly Asn Gin Ser Arg Cys 
100 105 110 

TGG ATA GCC TTG ACC CCT ACG TTG GCC GCG CCA CAC ATT GGC GCT CCA 3 84 

Trp He Ala Leu Thr Pro Thr Leu Ala Ala Pro His He Gly Ala Pro 
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lis 



120 



125 



CTT GAG TCC ATG CGA CGT CAT GTG GAT TTG ATG GTA GGC ACT GCC ACA 
Leu Glu Ser Met Arg Arg His Val Asp Leu Met Val Gly Thr Ala Thr 
130 135 140 



432 



TTG TGC TCC GCA CTC TAC ATT GGA GAT CTG TGC GGA GGC ATA TTT CTA 
Leu Cys Ser Ala Leu Tyr He Gly Asn Lau Cvs Glv Gly He Phe Leu 
145 150 " 155 * 160 



480 



GTG GGC CAG ATG TTC AAC TTC AGG CCC CGC CTG CAC TGG ACC ACC CAG 
Val Gly Gin Met Phe Asn Phe Arg Pro Arg Leu His Trp Thr Thr Gin 
165 170 175 



528 



GAG TGC AAT TGT TCC ATC TAT CCA GGC CAC ATC ACG GGT CAC AGA ATG 
Glu Cys Asn Cys Ser He Tyr Pro Gly His Xle Thr Gly His Arg Met 
130 185 190 



576 



GCG 
Ala 



579 



(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amine acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

Thr Cys Gly Ser Ala As? Leu Met Gly Tyr lie Pro Leu Val Gly Ala 
15 10 15 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 so 

Ala Val His Tyr His Asn Thr Ser Gly He Tyr His He Thr Asn Asp 
65 70 75 "80 

Cys Pro Asn Ser Ser He Val Phe Glu Ala Glu His His He Leu His 
85 90 95 

Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Gin Ser Arg Cys 
100 -aos 110 

Trp He Ala Leu Thr Pro Thr Leu Ala Ala Pro His He Gly Ala Pro 
115 120 125 
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Leu Glu Ser Met Arg Arg His Val Asp Leu MeC Val Gly Thr Ala Thr 
130 13S 140 

Leu Cys Ser Ala Leu Tyr lie Gly Asp Leu Cys Gly Gly lie Phe Leu 
145 150 155 160 

Val Gly Gin Met: Phe Asn Phe Arg Pro Arg Leu His Trp Thr Thr Gin 
165 ' 170 175 

Glu Cys Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg Met 
180 185 190 



Ala 



(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
{ iii) HYPOTHETICAL: NO 
(iii) ANT I- SENSE: NO 



FEATURE : 

(A) NAME /KEY : 

(B) LOCATION: 

FEATURE : 
(A) NAME /KEY : 
(3) LOCATION: 



CDS 

1. .579 



mat_peptide 
1. .576 



(xi) SEQUENCE DESCRj 

ACG TGC GGC TTT GCC GAC 
Thr Cys Gly Phe Ala Asp 
1 5 

CCT GTG GGT GGC GTC GCC 
Pro Val Gly Gly Val Ala 
20 

GAG GAC GGG ATT AAC TAT 
Glu Asp Gly He Asn Tyr 
35 

TCT ATC TTC CTT CTA GCA 
Ser He Phe Leu Leu Ala 
50 

GCG CAG CAC TAC CGG AAC 
Ala Gin His Tyr Arg Asn 



PTION: SEQ ID NO: 175: 

CTC ATG GGA TAC ATC CCG 
Leu Met Gly Tyr He Pro 
10 

AGG GCC TTG GCA CAT GGT 
Arg Ala Leu Ala His Gly 
25 

GCA ACA GGG AAT CTT CCC 
Ala Thr Gly Asn Leu Pro 
40 

CTT CTC TCG TGC TTG ACT 
Leu Leu - Ser Cys Leu Thr 
55 60 

ATC TCG GGC ATT TAT CAC 
He Ser Gly He Tyr His 



CTC GTG GGC GCC 4 3 

Leu Val Gly Ala 
15 

GTC AGG GCC GTG 96 
Val Arg Ala Val 
30 

GGT TGC TCC TTT 144 
Gly Cys Ser Phe 
45 

GTC CCG GCC TCG 192 
Val Pro Ala Ser 



GTC ACC AAT GAC 24 0 

Val Thr Asn Asp 
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65 70 75 80 



TGC CCG AAC TCT AGT ATA GTG TAT GAA GCT GAC CAT CAT ATC ATG CAT 288 
Cys Pro Asn Ser Ser He Val Tyr Glu Ala Asp His His He Met His 
85 90 95 

CTA CCA GGG TGT GTG CCT TGC GTG AGA ACC GGG AAC ACC TCG CGC TGC 336 
Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Thr Ser Arg Cys 
100 105 HO 

TGG GTT CCT TTA ACA CCC ACT GTG GCT GCC CCC TAT GTT GGC GCG CCG 3 84 

Trp Val Pro Leu Thr Pro Thr Val Ala Ala Pro Tyr Val Gly Ala Pro 
115 120 125 

CTC GAA TCC ATG CGG CGG CAC GTG GAC TTA ATG GTG GGT GCC GCC ACC 432 
Leu Glu Ser Met Arg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 

GTC TGC TCG GCC CTG TAC ATC GGA GAC CTT TGC GGA GGT GTC TTC CTG " 4a 0 

Val Cys Ser Ala Leu Tyr He Gly Asp Leu Cys Gly Gly Val Phe Leu 

rj 145 150 155 ISC 

fS GTC GGG CAG ATG TTC ACC TTC CGG CCG CGC CGC CAT TGG ACT ACC CAG 52 9 

I*j Val Gly Gin Met Phe Thr Phe Arg Pro Arg Arg His Trp Thr Thr Gir. 
Ul " 16 5 170 175 

Lr; GAC TGC AAC TGC TCT ATC TAT GAT GGC CAC ATC ACC GGC CAT AGA ATG 5 76 

Asp Cys Asn Cys Ser lie Tyr Asp Gly His lie Thr Gly His Arg Met 
4** ~ * 180 195 190 

□ GCT 579 

yi A^a 



(2) INFORMATION FOR SEQ ID NO : 176: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
1 5 10 15 * 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser lie Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 
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Ala Gin His Tyr Arg Asn lie Ser Gly lie Tyr His Val Thx Asn Asp 
65 70 75 80 

Cys Pro Asn Ser Ser He Val Tyr Glu Ala Asp His His He Met His 

85 90 95 

Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Thr Ser Arg Cys 
100 105 110 

Trp Val Pro Leu Tar Pro Thr Val Ala Ala Pro Tyr Val Gly Ala Pro 
115 120 125 

Leu Glu Ser Met: Arg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 

Val Cys Ser Ala Leu Tyr He Gly Asp Leu Cys Gly Gly Val Phe Leu 
145 150 155 160 

Val Gly Gin Met Phe Thr Phe Arg Pro Arg Arg His Trp Thr Thr Gin 
165 170 175 

Asp Cys Asn Cys Ser He Tyr Asp Gly His He Thr Gly His Arg Met 
130 135 190 

Ala 



(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 



FEATURE: 
(A) NAME / KEY : 
(3) LOCATION: 

FEATURE : 

(A) NAME / KEY : 

(B) LOCATION: 



CDS 

1 . . 579 



mat _ peptide 
1. .576 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

ACG TGC GGG TTC GCC GAC CTC ATG GGA TAC ATC CCG CTC GTG GGC GCT 4 3 

Thr Cys Gly Phe Ala Asp Leu Met "Gly Tyr He Pro Leu Val Gly Ala 
1*5 10 15 

CCA GTA GGA GGC GTC GCC AGA GCC TTG GCG CAT GGC GTC AGG GCT GTG 96 
Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
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GAG GAC 
Glu Asp 



TCT ATC 
Ser He 
50 

GCT CAG 
Ala Gin 
65 

TGC CCG 
Cys Pro 



20 

GGG ATC 
Gly He 
35 

TTC CTC 
Phe Leu 



CAC TAC 

His Tyr 



AAT 
Asn 



TTG 

Leu 



CGG 
Arg 



AAC TCC 
Asn Ser 



AGT 
Ser 
85 

GTG 
Val 



TAC 
Tyr 



GTA 

Val 



AAT 
Asn 
70 

ATT 
He 
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25 

GCA ACA GGG AAC CTT 
Ala Thr Gly Asn Leu 
40 




PCT/EP94/01323 



Leu 
55 

GCA 
Ala 



GTG 
Val 



CTC TCG CGC CTA 
Leu Ser Arg Leu 



TCG GGC 
Ser Gly 



TAT GAA 
Tyr Glu 



CTA CCC GGG TGT 
Leu Pro Gly Cys 
100 

TGG ATT CCT TTA ACA 
Trp He Pro Leu Thr 
115 

CTT ACG TCT GTA 
Leu Thr Ser Val 
130 



CCC 
Pro 



CCC 
Pro 



Cvs 



ACT 
Thr 



CGG 
Ara 



TTA TGC TCT GCC 
Leu Cys Ser Ala 
145 

GCA GGG CAG ATG 
Ala Gly Gin Met 



GAT TGC AAC TGT 
Asp Cys Asn Cys 
180 

GCC 
Ala 



CTC 
Leu 



CAG 
Gin 



TAC 
Tyr 
150 



GT( 

Val 

165 

TCC 
Ser 



AGT 
Ser 



CAT 
His 
135 

ATC 
lie 



TTC 
Phe 



GTA AGA 
Val Arg 

105 

GTA GCC 
Val Ala 
120 

GTG GAC 
Val Aso 



ATC TAC 
He Tyr 
75 

GCC GAC 
Ala Asp 
90 

ACT GGG 
Thr Gly 



CCC 
Pro 



ACT 
Thr 
60 

CAT 

His 



30 

GGC TGC TCC TTT 
Gly Cys Ser Phe 
45 

GTC CCA GCG TCT 
Val Pro Ala Ser 



CAT 
His 



GTC ACC 
Val Thr 



CAC ATC 
His He 



GTC CCC 
Val Pro 



AAT 
Asn 



TAC 
Tvr 



CTG ATG 
Leu Met 



GTG 
Val 
140 



GTC TCG 
Val Ser 
110 

CTC GGG 
Leu Gly 
125 

GGG GCG 
Giv Ala 



AAC GAC 
Asn Asp 
80 

ATG CAC 
Met His 
95 

CGT TGC 
Arg Cys 



GCT CCA 
Ala Pro 



GCC ACC 
Ala Thr 



GGA GAC CAT TGC GGA 
Gly As? His Cys Gly 
155 

CAA CCC CGG CGT CAT 
Gin Pro Arg Arg His 
170 



ATC TAT GTG GGC CAC ATC ACC 
lie Tyr Val Gly His He Thr 
135 



GGT GTC TTC TTG 
Gly Val Phe Leu 
ISO 

TGG ACT ACC CAG 
Trp Thr Thr Gin 
175 

GGC CAC AGG ATG 
Gly His Arg Met 
190 



144 



192 



240 



283 



336 



384 



432 



480 



523 



576 



579 



(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH -. 193 amino acids 

(B) TYPE: amiv.o acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 



SUBSTITUTE SHEET (RULE 26) 



WO 94/25601 




228 




PCT/EP94/01323 



Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 

15 10 x5 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 

20 25 * 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 

35 40 45 

Ser He Phe Leu Leu Val Leu Leu Ser Arg Leu Thr Val Pro Ala Ser 

50 55 60 

Ala Gin His Tyr Arg Asn Ala Ser Gly He Tyr His Val Thr Asn Asp 

65 70 75 80 

Cys Pro Asn Ser Ser He Val Tyr Glu Ala Asp His His He Met: His 

85 90 95 



pa* 



Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Val Ser Arg Cys 
100 105 HO 



Trp He Pro Leu Thr Pro Thr Val Ala Val Pro Tyr Leu Gly Ala Pro 
115 120 125 



Leu Thr Ser Val Arg Gin His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 



Leu Cys Ser Ala Leu Tyr He Gly Asp His Cys Gly Gly Val Phe Leu 

145 150 155 160 

Ala Gly Gin Met Val Ser Phe Gin Pro Arg Arg His Trp Thr Thr Gin 

165 170 175 



1*1 



Asp Cys Asn Cys Ser He Tyr Val Gly His He Thr Gly His Arg Met 
180 185 190 



Ala 



(2) INFORMATION FOR SEQ ID NO : 17 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES3 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1. . 579 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
ACCTGCGGCT TCGCCGACCT CATGGGATAC ATCCCGCTCG TAGGCGCCCC CGTGGGAGGC 60 
GTC3CCAGAR CTCTGGCGCA TG-GCGTCAGG GCTCTGGAAG ACGGGATCAA TTATGCAACA 120 
GGGAATCTTC CTGGTTGCTC TTTCTCTATC TCCCTTCTTG AACTTCTCTC GTGCCTGACT 
GTTCCCGCCT CAGCCATCCA CTA7C3CAAT GCTTCGGACG GTTATTATAT CACCAATGAT 
TGCCCGAACT CTAGCATAGT GTATGAAGCC GAGAACCACA TCTTGCACCT TCCGGGGTGT 300 
ATACCCTGTG TGAAGACCGG GAATCAGTC3 CGGTGCTGGG TGGCTCTCAC CCCCACGCTG 
GCGGCCCCAC ACCTACGTGC TCCGCTTTCG TCCTTACGGG CGCATGTGGA CCTAATGGTG 
GGGGCCGCCA CGGCATGCTC CGCTTTTTAC ATTGGAGATC TGTGCGGGGG TGTGTTTTTG 
GCGGGCCAAC TGTTCACTAT CCGGCCACGC ATTCA7GAAA CCACTCAGGA CTGCAATTGC 
TCCATCTACT CAGGG CACAT CAC3GGTNNN NNNNNNNNN 



180 
240 



360 
420 
430 
540 
579 



(2) INFORMATION FOR SEQ ID NO : 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amine acids 

(B) TYPE: amino acid 

(C) STSANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala 
! 5 1° 15 

Pro Val Gly Gly Val Ala Arc Xaa Leu Ala His Gly Val Arg Ala Leu 
20 25 30 



Giu Asd Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 4S 

Ser He Ser Leu Leu Glu Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 

Ala lie His Tyr Arg Asn Ala Ser As? Gly Tyr Tyr lie Thr Asn Asp 



65 



70 75 80 



Cys Pro Asn Ser Ser lie Val- Tyr Glu Ala Glu Asn His lie Leu His 
85 9° 93 

Leu Pro Gly Cys lie Pro Cys Val Lys Thr Gly Asn Gin Ser Arg Cys 
100 105 H° 
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Trp Val Ala Leu Thr Pro Thr Leu Ala Ala Pro His Leu Arg Ala Pro 
115 120 125 

Leu Ser Ser Leu Arg Ala His Val Asp Leu Met Val Gly Ala Ala Thr 

130 135 140 

Ala Cys Ser Ala Phe Tyr He Gly Asp Leu Cys Gly Gly Val Phe Leu 
145 150 155 ISO 

Ala Gly Gin Leu Phe Thr He Arg Pro Arg He His Glu Thr Thr Gin 
165 170 175 

Asp Cys Asg Cys Ser He Tyr Ser Gly His He Thr Gly Xaa Xaa Xaa 
180 185 190 

Xaa 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

( ix) FEATURE : 

(A) NAME / KEY : CDS 
(3) LOCATION: 1 . . 578 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

GCGTGCGGCT TCGCCGATCT CAT GGG AT AC ATCCCGCTCG TAGGCGCCCC CGTGGGTGGC 6 0 

GTCGCCAGAG CCCTGGCGCA CGGTGTTAGG GCTGTGGAGG ACGGGATTAA CTACGCAACA 120 

GGGAATCTTC CTGGTTGCTC TTTCTCTATC TNCCTTCTGG CACTTCTCTC GTGCCTGACT 18 0 

GTCCCGGCCT CGGCTCAGCA CTACCGGAAT GTCTCGGGCA TCTACCACGT CACCAATGAT 24 0 

TGCCCGAATT CCAGCATAGT GTATGAAGCC GATCACCACA TCATGCACTT ACCAGGGTGC 3 00 

ATACCCTGCG TGAGGACCGG GAACGTTTCG" CGCTGCTGGG TATCTCTGAC ACCTACTGTG 3 60 

GCTGCTCCCT ACCTCGGGGC TCCGCTTACG TCGCTACGGC GGCATGTGGA TTTGATGGTG 4 20 

GGTGCAGCCA CCCTTTGCTC TGCCCTCTAC GTCGGAGACC TCTGTGGAGG TGTCTTCCTA 4 80 
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GTGGGACAGA TGTTCACCTT CCAGCCGCGC CGCCACTGGA CCACTCAGGA CTQCAACTGC 540 
TCCATTTACG TCGGCCACAT CACAGGCCAC AGAATGGCT 579 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



111 



(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 13 2: 

Ala Cys Gly Phe Ala Asd Leu Me. Gly Tyr He Pro Leu Val Gly Ala 
1 5 10 13 

Y Gly Val Ala Arc Ala Leu Ala His Gly Val Arg Ala Val 



Pro Val Gly Gly 
20 



23 



hr Glv Asn Leu Pro Gly Cys Ser Phe 
0 ' 45 

Ser lie Xaa Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 



Glu Asp Gly He Asn Tyr Ala T! 
35 



50 



n Ala Gin His Tyr Arg Asn Val Ser Gly lie Tyr His Val Thr Asr. Asp 

S 65 70 75 

0 Cys Pre Asn Ser Ser He Val Tyr Glu Ala Asp His His lie Mec His 

H= 85 90 93 

Leu P-o Gly Cys He Pro Cys Val Arg Thr Gly Asn Val Ser Arg Cys 
100 «5 H° 



Tro 



Val Ser Leu Thr Pro Thr 



r Val Ala Ala Pro Tyr Leu Gly Ala Pro 



120 



115 

Leu Thr Ser Leu Arg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 

130 135 140 

Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val' Phe Leu 
145 150 155 

Val Gly Gin Mec Phe Thr Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
155 1 7 ° 17 

Asp Cys Asn Cys Ser lie Tyr Val Gly His He Thr Gly His Arg Mec 



125 



1B0 

Ala 



185 
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(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANEEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .579 



(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..579 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

ACC TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTA GGC GCC 4 8 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala 
15 10 15 

CCT GTG GGT GGC GTC GCC AGG GCC CTA GAA CAC GGT GTT AGG GCT GTG 96 
Pro Val Gly Gly Val Ala Arg Ala Leu Glu His Gly Val Arg Ala Val 
20 25 30 

GAG GAC GGT ATT AAT TAT GCA ACA GGG AAT CTC CCC GGT TGC TCT TTT 144 
Glu Asp Gly lie Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

TCT ATC TCC CTC TTG GCA CTT CTT TCG TGC CTG ACT GTT CCC ACC TCA 192 
Ser lie Ser Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 
50 55 60 

GCC GTC AAC TAT CGC AAC GCC TCG GGC GTC TAT CAT ATC ACC AAT GAC 240 
Ala Val Asn Tyr Arg Asn Ala Ser Gly Val Tyr Kis He Thr Asn Asp 
65 70 75 80 

TGC CCG AAT TCG AGC ATA GTG TAC GAG GCT GAC TAC CAC ATC CTA CAC 28 3 

Cys Pro Asn Ser Ser He Val Tyr Glu Ala Asp Tyr His He Leu His 
35 90 95 

CTC CCT GGG TGC TTA CCC TGC GTG AGG GTT GGG AAT CAG TCA CGC TGC 33 5 

Leu Pro Gly Cys Leu Pro Cys Val ~Arg Val Gly Asn Gin Ser Arg Cys 
100 105 110 

TGG GTG GCC CTT ACT CCC ACC GTG GCG GCG CCT TAC GTT GGT GCT CCG 3 34 

Trp Val Ala Leu Thr Pro Thr Val Ala Ala Pro Tyr Val Gly Ala Pro 
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11s 



120 125 



CTA GAA TCC CTC CGG AGT CAT GTG GAT CTG ATG GTA GGT GCT GCT ACT 
Leu Glu Ser Leu Arg Ser His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 

GTG TGC TCC GCT CTT TAC ATC GGG GAC CTG TGC GGT GGC GTA TTT TTG 
Va- Cys Ser Ala Leu Tyr lie Gly As? Leu Cys Gly Gly Val ?he Leu 
14 ; ISO 155 1S0 

G~" GGT CAG ATG TTT TCT TTC CAG CCG CGA CGC CAC TGG ACC ACS CAG 5. 

Vai G i y Gin Me- Phe Ser Phe Gin Pro Ar 3 Arg His Trp Thr Thr Gin 

1S5 170 175 

GAC TGC AAT TG~ TCT ATC TAC GCG GGG CAC GTT ACG GGC CAC AGG ATG 576 

Asp Cys Asn Cys Ser He Tyr Ala Gly His Val Thr Gly His Arg Met: 



180 135 190 



GCA 
Ala 



(2) INFORMATION FOR SEQ ID NC : 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
i-R (xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 



Th- Cys Glv Phe Ala As? Leu Met Gly Tyr lie Pro Leu Val Gly Ala 
*I 5 1° 15 

P-o Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 



35 40 45 



Se~ He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 
50 55 SO 



Ala Val Asn Tyr Arg Asn Ala Ser Gly lie Tyr His lie Thr Asn Asp 
65 70 " 

Cys Pro Asn Ala Ser He Val Tyr Glu Thr Glu Asn His He Leu His 
85 90 95 

Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Gin Ser Arg Cys 
100 105 

Trp Val Ala Leu Thr Pro Thr Val Ala Ser Pro Tyr Ala Gly Ala Pro 

115 120 125 

Leu Glu Pro Leu Arg Arg Hi. Val Asp Leu Met Val Gly Ala Ala Thr 



130 



135 



140 



- 579 
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Met Cys Ser Ala Leu Tyr lie Gly Asp Leu Cys Gly Gly Leu Phe Leu 
145 150 155 ISO 

Val Gly Gin Met Phe Thr Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
165 170 175 

Asp Cys Asn Cys Ser He Tyr Thr Gly His He Thr Gly His Arg Met 
180 1S5 190 

Ala 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: protein 

^ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1S2: 

Ala Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
UJ 15 10 15 

m 

fy Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 

20 25 30 



in 



Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Ser Phe Trp His Phe Ser Arg Ala * Leu Ser Arg Pro Arg 
50 55 60 

Leu Ser Thr Thr Gly Met Ser Arg Ala Ser Thr Thr Ser Pro Met lie 
65 70 75 80 

Ala Arg He Pro Ala * Cys Met Lys Pro lie Thr Thr Ser Cys Thr 
8S 90 95 

Tyr Gin Gly Ala Tyr Pro Ala * Gly Pro Gly Thr Phe Arg Ala Ala 
100 105 110 

Gly Tyr Leu * His Leu Leu Trp Leu Leu Pro Thr Ser Gly Leu Arg 
115 120 125 

Leu Arg Arg Tyr Gly Gly Met Trp He * Trp Trp Val Gin Pro Pro 
130 135 140 

Phe Ala Leu Pro Ser Thr Ser Glu Thr Ser Val Glu Val Ser Ser 

14 5 15 0 155 160 

Trp Asp Arg Cys Ser Pro Ser Ser Arg Ala Ala Thr Gly Pro Leu Arg 
165 170 175 

Thr Ala Thr Ala Pro Phe Thr Ser Ala Thr Ser Gin Ala Thr Glu Trp 
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(2) INFORMATION FOR SEQ ID NO: 13 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



m 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(3) LOCATION: 1..575 

(ix) FEATURE: 

(A) NAME / KEY : ma;_?e?cide 

(3) LOCATION: I.. 576 



(xi) SEQUENCE DESCRIPTION: SEC ID NO: 18 5 



L ACT TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTA GGC GCC 4 3 

w Thr Cvs Gly ?he Ala As? Leu Mec Gly Tyr lis Pro Leu Val Gly Ala 

m x ■ 5 10 15 



CCC GTG GGT GGC GTC GCC AGA GCC CTG GAA CAT GGT GTT AGG GCT GTG 96 
P-o Val Gi v Glv Val Ala Arg Ala Leu Glu His Gly Val Arg Ala Val 
20 25 30 

GAG GAC GGC ATC AAT TAT GCA ACA GGG AAT CTC CCC GGT TGC TCT TTC 144 
Glu Asp Gly He Asn Tyr Ala Thr Gly Asa Leu Pro Gly Cys Ser Phe 
35 40 45 

TCT ATC TAC CTC TTG GCA CTT CTC TCG TGC CTG ACT GTT CCC ACC TCG 192 
Ser He Tvr Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 
SO " 55 60 

GCC ATC CAC TAT CGC AAT GCC TCG GGC GTC TAC CAC GTC ACC AAT GAC 
Ala He H< s Tvr Arg Asn Ala Ser Gly Val Tyr His Val Thr Asn Asp 
65 * 70 75 80 

TGC CCG AAC TCG AGC ATA GTG TAC GAG GCC GAC CAC CAC ATC CTA CAC 
Cvs Pro Asn Ser Ser He Val Tyr Glu Ala As? His His He Leu His 
85 90 95 

Ctt CCA GGG TGC TTA CCC TGT GTG" AGG GTT GGG AAT CAG TCA CGT TGT 
Leu P-o Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gin Ser Arg Cys 
100 105 HO 

TGG GTG GCC CTC TCT CCC ACC GTG GCG GCG CCT TAC ATC GGT GCT CCA 



240 



288 



336 



384 
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Trp Val Ala Leu Ser Pro Thr Val Ala Ala 
US 120 

GTT GAA TCC TTC CGG AGA CAC GTG GAC ATG 
Val Glu Ser Phe Arg Arg His Val Aso Met 

130 135 

GTG TGC TCC GCT CTC TAT ATT GGG GAC TTG 
Val Cys Ser Ala Leu Tyr He Gly Asp Leu 
145 150 

GTT GGT CAG ATG TTT TCT TTC CGG CCA CGA 
Val Gly Gin Met Phe Ser Phe Arg Pro Arg 
165 170 

GAC TGC AAT TGT TCC ATC TAC GCG GGG CAC 
Asp Cys Asn Cys Ser He Tyr Ala Gly His 
180 las 

GCA 
Ala 



Pro Tyr He Gly Ala Pro 
125 

ATG GTG GGC GCT GCT ACT 
Met Val Gly Ala Ala Thr 
140 

TGT GGT GGC GTA TTC TTG 
Cys Gly Gly Val Phe Leu 
3-55 ISO 

CGC CAC TGG ACT ACG CAG 
Arg His Trp Thr Thr Gin 
175 

ATC ACT GGC CAC GGA ATG 
He Thr Gly His Gly Met 
190 



432 



480 



528 



576 



579 



(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Thr Cys Gly Phe Ala Asp Leu Met Glv Tvr lie Pro Leu Val Gly Ala 
1 5 lo Is 

Pro Val Gly Gly Val Ala Arg Ala Leu Glu His Gly Val Arg Ala Val 
20 25 * 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Tyr Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 

50 55 60 

Ala He His Tyr Arg Asn Ala Ser Gly Val Tyr His Val Thr Asn Asp 

65 70 75 80 

Cys Pro Asn Ser Ser He Val Tyr Glu Ala Asp His His He Leu His 
85 90 95 

Leu Pro Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gin Ser Arg Cys 
100 -105 HO 

Trp Val Ala Leu Ser Pro Thr Val Ala Ala Pro Tyr He Gly Ala Pro 
115 120 * 125 
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Val Glu Ser Phe Arg Arg His Val Asp Het Met Val Gly Ala Ala Thr 
130 135 140 

Val Cys Ser Ala Leu Tyr He Gly Asp Leu Cys Gly Gly Val Phe Leu 
145 150 155 160 

Val Gly Gin Met Phe Ser Phe Arg Pro Arg Arg His Trp Thr Thr Gin 
165 170 175 

Asp Cys Asn Cys Ser lie Tyr Ala Gly His He Thr Gly His Gly Met 
180 135 190 

Ala 



(2) INFORMATION FOR SEQ ID NO: 187; 



01 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 579 



W ( ix) FEATURE : 

(A) NAME / KEY : mat_peptide 

(B) LOCATION: 1 . . 576 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

ACT TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTA GGC GCC 4 8 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala 
15 10 15 

CCT GTG GGT GGC GTC GCC AGG GCC CTG GCA CAC GGT GTT AGG GCT GTG 96 
Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

GAG GAC GGG ATC AAT TAT GCG ACA GGG AAT CTT CCC GGT TGC TCT TTC 144 
Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

TCT ATC TTC CTC TTG GCA CTT CTT "TCG TGC CTG ACT GTT CCC ACC TCG l^ 2 
Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 
50 55 60 

GCC GTC AAC TAT CGC AAT GCC TCG GGC ATC TAT CAC ATC ACC AAT GAC 24 0 
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Ala Val Asn Tyr Arg Asn Ala Ser Gly He Tyr Hi s n e Thr Asn Asp 

65 70 75 80 

TGC CCG AAC TCG AGC ATA GTG TAC GAG ACC GAG CAC CAC ATC CTA CAC 
Cys Pro Asn Ser Ser He Val Tyr Glu Thr Glu His His He Leu His 
85 90 95 

CTC CCA GGG TGT TTA CCC TGC GTG AGG GTT GGG AAT CAG TCA CGC TGC 
Leu Pro Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gin Ser Arg Cys 
100 105 HO 

TGG GTG GCC CTC ACT CCC ACC GTG GC3 GCG CCT TAC ATC GGC GCT CCG 33. 
Trp Val Ala Leu Thr Pro Thr Val Ala Ala Pro Tyr He Gly Ala Pro 
115 120 125 

CTT GAA TCC CTC CGG AGT CAT GTG GAT CTG ATG GTA GGT GCC GCT ACT 432 
Leu Glu Ser Leu Arg Ser His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 

' GCG TGC TCC GCT CTT TAC ATC GGA GAC CTG TGC GGT GGC GTA TTT TTG 43 0 

Ala Cys Ser Ala Leu Tyr He Gly Asp Leu Cys Gly Gly Val Phe Leu 
145 150 155 160 

GTT GGT CAG ATG TTC TCT TTC CAG CCG CGG CGC CAC TGG ACT ACS CAG 52 3 

Val Gly Gin Mez Phe Ser Phe Gin Pro Arg Arg Kis Trp Thr Thr Gin 
155 170 175 

GAC TGC AAT TGT TCC ATC TAC GCG GGG CAC GTT ACG GGC CAC AGG ATG 57 € 

Asp Cys Asn Cys Ser He Tyr Ala Gly His Val Thr Gly His Arg Met 
190 135 190 



GCA 
Ala 



(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 : 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala 
1 5 10 15 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly He Asn Tyr Ala Thr -Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 
50 55 60 



579 
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Ala Val Asn Tyr Arg Asn Ala Ser Gly He Tyr His He Thr Asn Asp 



65 



70 



75 



80 



Cys Pro Asn Ser Ser He Val Tyr 



Glu Thr Glu His His He Leu His 



85 



90 



95 



Leu Pro Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gin Ser Arg Cys 
100 105 HO 

Trp Val Ala Leu Thr Pro Thr Val Ala Ala Pro Tyr He Gly Ala Pro 

US 120 125 

Leu Glu Ser Leu Arg Ser His Val Asp Leu Met Val Gly Ala Ala Thr 
130 135 140 

Ala Cys Ser Ala Leu Tyr He Gly Asp Leu Cys Gly Gly Val Phe Leu 



145 



150 



155 



160 



Val Gly Gin Met 



Phe Ser Phe Gin Pro Arg Arg His Trp Thr Thr Gin 



165 



170 



175 



Asp Cys Asn Cys Ser lie Tyr Ala Gly His Val Thr Gly His Arg Met 
180 195 190 



Ala 



(2) INFORMATION FOR SEQ ID NO: 18 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 57 9 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 579 

(ix) FEATURE: 

(A) NAME / KEY : mat_peptide 

(B) LOCATION: 1 . .5 76 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

ACG TGC GGC TTC GCC GAC CTC ATG GGA TAC ATC CCG CTC GTG GGC GCC 
Thr Cys Gly Phe Ala Aso Leu Met Gly Tyr He Pro Leu Val Gly Ala 
15 10 15 
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CCC GTT GGG GGC GTC GCC AGG GCC CTG GOG CAT GGC GTC AGG GCT GTG 
Pro Val Gly Gly Val Ala Arg Ala Ijeu Ala His Gly Val Arg Ala Val 
20 25 30 



96 



GAG GAC GGG ATT AAC TAT GCG ACA GGG AAT CTT CCC GGT TGC TCT TTC 
Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 



144 



TCT ATC TTC CTC CTG GCA CTT CTT TCG TGC CTC ACT GTC CCA GCG TCA 
Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 



192 



GCT GAG CAC TAC CGG AAT GCT TCG GGC ATC TAT CAC ATC ACC AAT GAC 
Ala Glu Kis Tyr Arg Asn Ala Ser Gly He Tyr His He Thr Asa Asp 
65 70 75 90 



240 



TGT CCG AAT TCC AGC GTA GTC TAT GAA ACT GAC CAC CAT ATA TTG CAC 
Cys Pro Asn Ser Ser Val Val Tyr Glu Thr Asp His His He Leu His 
85 90 95 



283 



TTG CCG GGG TGC GTA CCC TGC GTG AGG GCC GGG AAC GTG TCT CGT TGC 
Leu Pro Gly Cys Val Pro Cys Val Arg Ala Gly Asn Val Ser Arg Cys 
100 105 110 



336 



TGG ACG CCG GTA ACA CCT ACG GTG GCT GCC GTA TCC ATG GAC GCT CCG 
Trp Thr Pro Val Thr Pro Thr Val Ala Ala Val Ser Mer Asp Ala Pro 
115 120 125 



384 



CTC GAG TCC TTC CGG CGG CAT GTG GAC CTA ATG GTA GGT GCG GCC ACC 4 32 

Leu Glu Ser Phe Arg Arg His Val Asp Leu Met: Val Gly Ala Ala Thr 
130 135 140 

GTG TGT TCT GTC CTC TAT GTT GGA GAC CTC TGT GGA GGT GCT TTC CTA 4 80 

Val Cys Ser Val Leu Tyr Val Gly Asp Leu Cys Gly Gly Ala Phe Leu 
145 150 155 160 



GTG GGG CAG ATG TTC ACC TTC CAG CCG CGT CGC CAC TGG ACC ACG CAG 
Val Gly Gin Met Phe Thr Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
165 170 175 



528 



GAT TGT AAT TGC TCC ATC TAT ACT GGC CAT ATC ACC GGC CAC AGG ATG 
Asp Cys Asn Cys Ser He Tyr Thr Gly His He Thr Gly Kis Arg Mec 
180 135 190 



576 



GCG 
Ala 



579 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 190 : 

Thr Cys Gly Phe Ala Asp Leu Mec Gly Tyr lie Pro Leu Val Gly Ala 
! 5 10 I 5 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 

A^a Glu Tyr Arg Asn Ala Ser Gly He Tyr His He Thr Asn As? 

65 70 75 30 

Cvs Pro Asn Ser Ser Val Val Tyr Glu Thr Asp His His He Leu His 
85 90 95 

Leu P-o Gly Cvs Val Pro Cys Val Arg Ala Gly Asn Val Ser Arg Cys 
100 105 HO 

^ro Th- Pro Val Thr Pro Thr Val Ala Ala Val Ser Mec Asp Ala Pro 
^ * 113 120 125 

Leu Glu Ser Phe Arg Arg His Val As? Leu Met Val Gly Ala Ala Thr 
130 135 140 

Vai cys Ser Val Leu Tyr Val Gly As? Leu Cys Gly Gly Ala Phe Leu 

150 155 ISO 

Val G iy Gin Met Phe Thr Phe Gin Pro Arg Arg His Trp Thr Thr Gin 
1S5 I 70 175 

Asp Cys Asn Cys Ser He Tyr Thr Gly His He Thr Gly His Arg Mec 
180 ia5 190 

Ala 



(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 9 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
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(B) LOCATION: 1. .289 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1. .286 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC ACC AAC 48 
Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

CGC CGC CCC ATG GAC GTT AAG TTC CCG GGC GGT GGC CAG ATC GTT GGT 96 
Arg Arg Pro Met Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Giy 
20 25 30 

GGA GTT TAG TTG TTG CCG CGC AGG GGC CCC AGG TTG GGT GTG CGC GCG - 144 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACT AGG AAG ACT TCG GAG CGG TCG CAA CCT CGT GGG AGA CGT CAG CCT 19 2 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 SO 

ATC CCC AAG GCA CGT CGA TCT GAG GGA AGG TCC TGG GCT CAG CCC GGG 24 0 

lie Pro Lys Ala Arg Arg Ser Glu Gly Arg Ser Trp Ala Gin Pro Gly 
65 70 75 80 

TAC CCA TGG CCT CTT TAC GGT AAT GAG GGT TGT GGG TGG GCA GGA TGG G 2 89 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Giy Trp 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

Men Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Met Asp Val Lys Phe Pro Gly Gly Giy Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Giy Pro Arg Leu Giy Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Giy Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Ala Arg Arg Ser Glu Gly Arg Ser Trp Ala Gin Pro Gly 
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65 



70 75 80 



Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 



(2) INFORMATION FOR SZZ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



01 
j fs 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..498 

(ix) FEATURE: 

(A) NAME /KEY: mat _peptide 

(B) LOCATION: 1 . .495 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



ATC CCC AAG GCG CGC CGG TCC GAG GGC AGG TCC TGG GCG CAA GCC GGG 
lie Pro Lys Ala Arg Arg Ser Glu Gly Arg Ser Trp Ala Gin Ala Gly 
65 70 75 80 

TAC CCC TGG CCC CTC TAT GGC AAT GAG GGC TGT GGG TGG GCA GGG TGG 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

CTC CTG TCT CCT CGC GGC TCT CGG CCA TCT TGG GGC CCA AAT GAT CCC 



43 



ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC ACC AAC 
Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 10 15 

CGC CGC CCT ATG GAC GTA AAG TTC CCG GGC GGT GGA CAG ATC GTT GGC 
Arg Arg Pro Met Ast> Vai Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCC CGG TTG GGT GTG CGC GCG 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACT CGG AAG ACT TCG GAG CGG TCG CAA CCT CGT GGC AGG CGT CAA CCT 192 
Thr Arg Lvs Thr Ser Glu Axg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 * 55 60 



96 



144 



240 



283 



336 
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Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 no 



CGG CGG AGA TCG CGC AAT CTG GGT AAG GTC ATC GAT ACC CTG ACG TGC 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 " 125 



384 



GGC TTC GCC GAC CTC AT3 GGA TAC ATC CCG CTC GTG GGC GCC CCC GTC 
Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Val 
130 135 140 



432 



GGG GGC GTC GCC AGG GCC CTG GCG CAT GGC GTC AGG GCT GTG GAG GAC 
Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val Glu Asp 
145 150 155 160 



480 



GGG ATT AAC TAT CGA CAG 
Gly He Asn Tyr Arg Gin 
165 



498 



{2} INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 165 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Met Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Ser Glu Gly Arg Ser Trp Ala Gin Ala Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He As? Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Val 
130 135 140 
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Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val Glu Asp 
145 150 155 ISO 

Gly lie Asn Tyr Arg Gin 
165 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



03 (ix) FEATURE : 

Si (A) NAME /KEY: CDS 

yj (3} LOCATION: 1. .579 



i i 



(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1. .576 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

ACG TGC GGA TTC GCC GAC CTC GTG GGG TAC ATC CCG CTC GTA GGC GGC 4 8 

Thr Cys Gly Phe Ala Asp Leu Val Gly Tyr lie Pro Leu Val Gly Gly 
15 10 15 

CCC GTT GGG GGC GTC GCA AGG GCT CTC GCA CAT GGT GTG AGG GTT CTT 96 
Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 
20 25 30 

GAG GAC GGG GTG AAT TAT GCA ACA GGG AAT CTG CCT GGT TGC TCT TTC 14 4 

Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

TCT ATC TTC ATT CTT GCA CTT CTC TCG TGC CTC ACT GTC CCG GCC TCT 19 2 

Ser lie Phe lie Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 

GCA GTT CCC TAC CGA AAT GCC TCT GGG ATC TAT CAT GTC ACC AAT GAT 24 0 

Ala Val Pro Tyr Arg Asn Ala Ser Gly lie Tyr His Val Thr Asn Asp 
65 70 75 80 

TGC CCA AAC TCT TCC ATA GTC TAT "GAG GCA GAT GAT CTG ATC CTA CAC 28 8 

Cys Pro Asn Ser Ser lie Val Tyr Glu Ala Asp Asp Leu lie Leu Kis 
35 90 95 

GCA CCT GGC TGC GTG CCT TGT GTC AGG AAA GAT AAT GTG AGT AGG TGC 3 36 
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Ala Pro Gly Cys Val Pro Cys Val Arg Lys Asp Asn Val Ser Arg Cys 
100 105 no 

TGG GTC CAA ATT ACC CCC ACG CTG TCA GCC CCG AGC TTC GGA GCA GTC 3 84 

Trp Val Gin lie Thr Pro Thr Leu Ser Ala Pro Ser Phe Gly Ala Val 
115 120 125 



ACG GCT CCC CTT CGG AGA GCC GTC GAT TAC TTG GTG GGA GGG GCT GCC 432 
Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Val Gly Gly Ala Ala 
130 135 140 

CTC TGC TCC GCG TTA TAC GTT GGA GAC GCG TGT GGG GCA CTA TTT TTG 480 
Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu 
145 150 155 ISO 

GTA GGC CAA ATG TTC ACC TAT AGG CCT CGC CAG CAT GCT ACG GTG CAG 523 
Val Gly Gin Met Phe Thr Tyr Arg Pro Arg Gin His Ala Thr Val Gin 

165 170 175 

GAC TGC AAC TGT TCC ATC TAC AGT GGC CAC GTC ACC GGC CAT CAG ATG 576 
Asp Cys Asn Cys Ser lie Tyr Ser Gly His Val Thr Gly His Gin Met 
180 185 190 

GCA 5 79 

Ala 



fU (2) INFORMATION FOR SEQ ID NO: 196: 

g " U) SEQUENCE CHARACTERISTICS: 

H (A) LENGTH: 193 amino acids 

ff\ (3) TV PE : amino acid 

™- (D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Thr Cys Gly Phe Ala Asp Leu Val Gly Tyr lie Pro Leu Val Gly Gly 
15 10 15 

Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 
20 25 30 

Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
35 40 45 

Ser lie Phe lie Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 
50 55 60 

Ala Val Pro Tyr Arg Asn Ala Ser Gly He Tyr His Val Thr Asn Asp 
65 * 70 75 80 

Cys Pro Asn Ser Ser He Val Tyr Glu Ala Asp Asp Leu He Leu His 
85 90 95 

Ala Pro Gly Cys Val Pro Cys Val Arg Lys Asp Asn Val Ser Arg Cys 
100 105 HO 
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Trp Val Gin He Thr Pro Thr Leu Ser Ala Pro Ser Phe Gly Ala Val 

115 120 
Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Val Gly Gly Ala Ala 

130 135 

va^ r-»v \<=>s Ala Ors Gly Ala Leu Phe Leu 
Leu Cys Ser Ala Leu Tyr Va_ G-y Asp A_a c/s * y 

145 155 

Val Gly Gin Met Phe Thr Tyr Arg Pro Arg Gin His Ala Thr Val Gin 
1S5 170 1 



18S 



Asp Cys Asn Cys Ser lie Ty 
180 

Ala 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 579 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1 . - 579 

(ix) FEATURE: 

(A) NAME/KEY: ma -_pept xde 
(3) LOCATION: 1. .576 



r Ser Gly His Val Thr Gly His Gin Mec 



190 



(XM SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

S 5 5 = S Z £ £ Z S S Z S S S S 
S S S 5 S S 5 S S K SI 5 S S E S 

20 25 



a— ^ tat G C^ ACA ~GGG AAT CTC CCC GGT TGC TCT TTC 

Giu Z t% » ^ -« - «t - - - «v ^ •« «- 



35 40 



43 



96 



144 



192 
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Ser lie Tyr 
50 

GCC ATC CAC 
Ala He His 
65 

TGC CCG AAC 
Cys Pro Asn 



CTT CCA GGG 
Leu Pro Gly 



TGG GTG GCC 
Trp Val Ala 
115 



Leu Leu 



TAT CGC 
Tyr Arg 



TCG AGC 
Ser Ser 
85 

TGC TTA 
Cys Leu 
100 

CTC TCT 
Leu Ser 



Ala Leu 
55 

AAT GCC 
Asn Ala 
70 

ATA GTG 
He Val 



CCC TGT 
Pro Cys 



CCC ACC 
Pro Thr 



248 

Leu Ser Cys 



TCG GGC GTC 
Ser Gly Val 



TAC GAG GCC 
Tyr Glu Ala 
90 

GTG AGG GTT 
Val Arg Val 
105 

GTG GCG GCG 
Val Ala Ala 
120 



Leu Thr Val 
60 

TAC CAC GTC 
Tyr His Val 
75 

GAC CAC CAC 
Asp His His 



GGG AAT CAG 
Gly Asn Gin 



CCT TAC ATC 
Pro Tyr lie 
125 



PCT/EP94/01323 



Pro Thr Ser 



ACC AAT GAC 
Thr Asn Asp 
80 

ATC CTA CAC 
He Leu His 
95 

TCA CGT TGT 
Ser Arg Cys 
110 

GGT GCT CCA 
Gly Ala Pro 



240 



238 



336 



384 



GTT GAA TCC TTC CGG AGA CAC GTG GAC ATG ATG GTG GGC GCT GCT ACT 4 32 

y Val Glu Ser Phe Arg Arg His Val Asp Met Met Val Gly Ala Ala Thr 

d 130 135 140 

S.JL5 

SJ GTG TGC TCC GCT CTC TAT ATT GGG GAC TTG TGT GGT GGC GTA TTC TTG 430 

Id Val Cys Ser Ala Leu Tyr lie Gly Asp Leu Cys Gly Gly Val Phe Leu 

fl| 145 150 1S5 160 

i 

f y 

£ GTT GGT CAG ATG TTT TCT TTC CGG CCA CGA CGC CAC TGG ACT ACG CAG 528 

J" Val Gly Gin Met Phe Ser Phe Arg Pro Arg Arg His Trp Thr Thr Gin 

L 165 170 175 

GAC TGC AAT TGT TCC ATC TAC GCG GGG CAC ATC ACT GGC CAC GGA ATG 5 76 

U Asp Cys Asn Cys Ser lie Tyr Ala Gly His He Thr Gly His Gly Met 

Ifl 130 135 190 

|«fc GCA 579 

Ala 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala 
15 10 15 

Pro Val Gly Gly Val Ala Arg Ala Leu Glu His Gly Val Arg Ala Val 
20 25 30 

Glu Asp Gly He Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 
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35 40 45 

Ser lie Tyr Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 
SO 55 60 

Ala lie His Tyr Arg Asn Ala Ser Gly Val Tyr His Val Thr Asn Asp 
65 70 75 80 

Cys Pro Asn Ser Ser lie Val Tyr Giu Ala Asp His His lie Leu His 
85 90 95 

Leu Pro Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gin Ser Arg Cys 
100 105 110 

Trp Val Ala Leu Ser Pro Thr Val Ala Ala Pro Tyr lie Gly Ala Pro 
115 120 125 

Val Glu Ser Phe Arg Arg His Val Asp Met Met Val Gly Ala Ala Thr 
130 135 140 

Val Cys Ser Ala Leu Tyr lie Gly Asp Leu Cys Gly Gly Val Phe Leu 
145 150 155 160 

Val Gly Gin Met Phe Ser Phe Arg Pro Arg Arg His Trp Thr Thr Gin 
165 170 175 

Asp Cys Asn Cys Ser He Tyr Ala Gly His He Thr Gly His Gly Met 
180 195 190 

Ala 



(2) INFORMATION FOR SEQ ID NO : 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1470 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(3) LOCATION: 2. .1470 

(ix) FEATURE: 

(A) NAME /KEY : mat_pep t ide 

(B) LOCATION: 2. .1467 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
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PC 



A TCA CCA CCG GAG CTT CTA TCA CAT ACT CCA CTT ACG GCA AGT TCC 
Ser Pro Pro Glu Leu Leu Ser His Thr Pro Leu Thr Ala Ser Ssr 
1 5 10 is 

TTG CTG ATG GAG GGT GTT CAG GCG GCG CGC ATG ACG TGA TCA TAT GCG 
Leu Leu Men Glu Gly Val Gin Ala Ala Arg Mec Thr * Ser Tyr Ala 
20 25 30 

ACG AGT GCC ATT CCC AGG ACG CCA CCA CCA TTC TTG GGA TAG GCA CTG 
Thr Ser Ala lie Pro Arg Thr Pro Pro Pro Phe Leu Gly * Ala Leu 
35 40 45 



TCC TTG ACC AGG CAG AGA CGG CTG GAG CTA GGC TCG TCG TCT TGG CCA 
Ser Leu Thr Arg Gin Arg Arg Leu Glu Leu Gly Ser Ser Ser Trp Pro 
50 55 60 



190 



CGG CCA CCC CTC CCG GCA GTG TGA CAA CGC CCC ACC CCA ACA TCG AGG 
Arg Pro Pro Leu Pro Ala Val * Gin Arg Pre Thr Pro Thr Ser Arg 
65 70 75 



233 



^1 



AAG TGG CCC TGC CTC AGG AGG GGG AGG TTC CCT TCT ACG GCA GAG CCA 
Lys Trp Pro Cys Leu Arg Arg Gly Arg Phe Pre Ser Thr Ala Glu Pro 
80 85 90 95 



296 



TTC CCC TTG CTT TTA TAA AGG GTG GTA GGC ATC TCA TCT TCT GCC ATT 
Phe Pro Leu Leu Leu * Arg Val Val Gly lie Ser Ser Ser Ala lie 
100 105 HQ 



334 



y - 

Q 
in 



CCA AGA AAA AAT GTG ATG AAC TCG CCA AGC AAC TGA CCA GCC TGG GCG 
Pro Arg Lys Asn Val Met Asn Ser Pro Ser Asn * Pro Ala Trp Ala 
115 120 125 

TGA ACG CCG TGG CAT ATT ATA GAG GTC TAG ACG TCG CCG TCA TAC CCA 
* Thr Pro Trp His lie lie Glu Val * Thr Ser Pro Ser Tyr Pro 
130 135 140 

CAA CAG GAG ACG TGG TCG TGT GCA GCA CCG ACG CGC TCA TGA CGG GAT 
Gin Gin Glu Thr Trp Ser Cys Ala Ala Pro Thr Arg Ser * Arg Asp 
145 150 155 

TCA CCG GCG ACT TTG ATT CTG TCA TAG ACT GCA ACT CCG CCG TCA CTC 
Ser Pro Ala Thr Leu He Leu Ser * Thr Ala Thr Pro Pro Ser Leu 
160 165 170 175 

AGA CGG TGG ACT TCA GTC TGG ATC CCA CTT TTA CCA TTG AGA CTA CCA 
Arg Arg Trp Thr Ser Val Trp He Pro Leu Leu Pro Leu Arg Leu Pro 
130 135 190 

CAG TGC CCC AGG ACG CAG TGT CCA GAA GCC AGC GTT GGG GCC GCA CGG 
Gin Cys Pro Arg Thr Gin Cys Pro Glu Ala Ser Val Gly Ala Ala Arg 
195 200 205 

GGA GAG GTA GGC ACG GCA TAT ACC GGT ATG TCT CGG CTG GAG AGA GAC 
Gly Glu Val Gly Thr Ala Tyr Thr* Gly Met Ser Arg Leu Glu Arg Asp 
210 * 215 220 

CGT CTG GCA TGT TCG ACT CCG TGG TGC TCT GTG AGT GCT ACG ATG CCG 
Arg Leu Ala Cys Ser Thr Pro Trp Cys Ser Val Ser Ala Thr Me- Pro 



382 



430 



478 



526 



574 



622 



670 



718 
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225 



230 



235 



r 5 ^ 



GAT GTG CAT GGT ACG ATC TGA CTC CTG CCG AGA CTA CCG TGA C5Q T TGC 
Asp Val His Gly Thr He * Leu Leu Pro Arg Leu Pro - Gly cys 
240 245 250 255 

GCG CTT ACT AAA CAC CCC CGG GCT CCC TGT CTG TCA GGA CCA TTT GGA 
Ala Leu Thr Lys His Pro Arg Ala Pro Cys Leu Ser Gly ?" ?ha Gly 
2S0 255 270 

ATT CTG GGA GGG GGT GTT CAC GGG GCT CAC TAA CAT CGA CGC TCA CAT 
He Leu Gly Gly Gly Val His Gly Ala His * His Arg Arg Ser His 
275 230 285 

GCT GTC ACA GAC CAA ACA GGG TGG GGA GAA TTT CCC ATA CCT TGT AGC 
Ala Val Thr Asd Gin Thr Gly Tr? Gly Glu Phe Pro He Pro Cys Ser 
290 " 295 300 

GTA CCA AGC AAC AGT CTG TGT TCG CGC GAA AGC GCC CCC CCC GAG CTG 
Val Pro Ser Asn Ser Leu Cys Ser Arg Glu Ser Ala Pro Pro Gin Leu 
305 310 315 

GGA CAC AAT GTG GAA ATG CAT GCT CCG TCT CAA ACC GAC TTA ACT GC-C 
Giy His Asn Val Glu Met His Ala Pro Ser Gin Thr As? Leu Thr Gly 
320 325 330 333 

CT ACT CCC CTC TTG TAC AGG CTG GGG CCC GTC CAG AAT GAG ATC ACA 
P-o Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gin Asn Glu He Thr 
340 345 350 

CTG ACG CAC CCC ATC ACC AAG TAC ATT ATG GCT TGC ATG TCT GCG GAC 
Leu Thr His Pro He Thr Lys Tyr He Met Ala Cys Met Ser Ala Asp 
355 3S0 365 

TTG GAG GTC ATT ACC AGC ACT TGG GTT CTG GTG GGG GGC GTT GTG GCG 
Leu Glu Val He Thr Ser Thr Trp Val Leu Val Gly Gly Val Val Ala 
370 375 380 

GCC CTG GCG GCC TAC TGC TTG ACG GTG GGT TCG GTA GCC ATA GTC GGT 
Ala Leu Ala Ala Tyr Cys Leu Thr Val Gly Ser Val Ala He Val Gly 
385 390 395 

AGG ATC ATC CTC TCT GGG AAA CCT GCC ATC ATT CCC GAT AGG GAG GTA 
™ He He Leu Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val 
400 405 410 = 

TTA TAC CAG CAA TTT GAT GAG ATG GAG GAG TGC TCG GCC TCG TTG CCC 
leu Sr Tin Gin Phe Asp Glu Met Glu Glu Cys Ser Ala Ser Leu Pro 

^ -5 c 4 J U 

420 42b 

r ^ r rAA TTC AAA GAG AAA GTG 

TAT ATG GAC GAA ACA CGT GCC A* : GvX GuA CAA TIL 

Tvr Mec Aso Glu Thr Arg Ala He Ala Gly Gin Phe Lys Glu Lys Va. 

' 435 "° *" 

CTC GGC TTC ATC AGC ACG ACC GGC CAG AAG GCT GAA ACT CTG AAG CCG 
Leu Gly Phe lie Ser Thr Thr Gly Gin Lys Ala Glu Thr Leu L/a 
450 455 460 



8t 



910 



953 



100S 



105- 



1102 



1150 



1198 



124c 



1294 



1342 



1390 
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GCA GCC ACG TCT GTG TGG AAC AAG GCT GAG CAG TTC TGG CCA CAT ACA 
Ala Ala Thr Ser Val Trp Asn Lys Ala Glu Gin Phe Trp Pro His Thr 
465 470 475 



1438 



TGT GGA ACT TCA TCA GTG GGA TAC AAT AAT AG 
Cys Gly Thr Ser Ser Val Gly Tyr Asn Asn 
430 485 



1470 



(2) INFORMATION FOR SEQ ID NO : 197: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1485 base pairs 
(3} TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(3) LOCATION: 1..1485 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

TGTGCCAGGA CCATCACCAC CGGAGCTTCT ATCACATACT CCACTTACGG CAAGTTCCTT 6 0 

GCTGATGGAG GGTGTTCAGG CGGCGCGCAT GACGTGATCA TATGCGACGA GTGCCATTCC 12 0 

CAGGACGCCA CCACCATTCT TGGGATAGGC ACTGTCCTTG ACCAGGCAGA GACGGCTGGA 18 0 

GCTAGGCTCG TCGTCTTGGC CACGGCCACC CCTCCCGGCA GTGTGACAAC GCCCCACCCC 24 0 

AACATCGAGG AAGTGGCCCT GCCTCAGGAG GGGGAGGTTC CCTTCTACGG CAGAGCCATT 3 00 

CCCCTTGCTT TTATAAAGGG TGGTAGGCAT CTCATCTTCT GCCATTCCAA GAAAAAATGT 3 60 

GATGAACTCG CCAAGCAACT GACCAGCCTG GGCGTGAACG CCGTGGCATA TTATAGAGGT 42 0 

CTAGACGTCG CCGTCATACC CACAACAGGA GACGTGGTCG TGTGCAGCAC CGACGCGCTC 4 80 

ATGACGGGAT TCACCGGCGA CTTTGATTCT GTCATAGACT GCAACTCCGC CGTCACTCAG 54 0 

ACGGTGGACT TCAGTCTGGA TCCCACTTTT ACCATTGAGA CTACCACAGT GCCCCAGGAC 6 00 

GCAGTGTCCA GAAGCCAGCG TTGGGGCCGC ACGGGGAGAG GTAGGCACGG CATATACCGG 66 0 

TATGTCTCGG CTGGAGAGAG ACCGTCTGGC ATGTTCGACT CCGTGGTGCT CTGTGAGTGC 72 0 

TACGATGCCG GATGTGCATG GTACGATCTG ACTCCTGCCG AGACTACCGT GAGGTTGCGC 78 0 

GCTTACNTAA ACACCCCCGG GCTCCCTGTC TGTCAGGACC ATTTGGAATT CTGGGAGGGG 84 C 

GTGTTCACGG GGCTCACTAA CATCGACGCT CACATGCTGT CACAGACCAA ACAGGGTGGG 90 C 

GAGAATTTCC CATACCTTGT AGCGTACCAA GCAACAGTCT GTGTTCGCGC GAAAGCGCCC 96 C 
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CCCCCCAGCT GGGACACAAT GTGGAAATGC ATGCTCCGTC TCAAACCGAC NTTAACTGGC 1020 

CCTACTCCCC TCTTGTACAG GCTGGGGCCC GTCCAGAATG AGATCACACT GACGCACCCC 1080 

ATCACCAAGT ACATTATGGC TTGCATGTCT GCGGACTTGG AGGTCATTAC CAGCACTTGG 1140 

GTTCTGGTGG GGGGCST73T GGCSGCCCTG GCGGCCTACT GCTTGACGGT GGGTTCGGTA 1200 

GCCATAGTCG GTAGGATCAT CCTCTCTGGG AAACCTGCCA TCATTCCCGA TAGGGAGGTA 1260 

TTATACCAGC AATTTGATGA GATGGAGGAG TGCTCGGCCT CGTTGCCCTA TATGGACGAA 1320 

ACACGTGCCA TTGCCGGACA ATTCAAAGAG AAAGTGCTCG GCTTCATCAG CACGACCGGC 138 0 

CAGAAGGCTG AAACTCTGAA GCCGGCAGCC ACGTCTGTGT GGAACAAGGC TGAGCAGTTC 144 0 

TGGNCCACAT ACATGTGGAA CTTCATCAGT GGGATACAAT AATAG - 1495 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS -. 

(A) LENGTH: 4 84 amino acids 

(B) TYPE: arr.ino acid 

(C) STHANDEDNES3 : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 193: 

Cys Ala Arg Thr lie Thr Thr Gly Ala Ser lie Thr Tyr Ser Thr Tyr 
, 5 10 15 



Gly Lys Phe 



Leu Ala As? Gly Gly Cys Ser Gly Gly Ala His Asp Val 



30 



20 25 
He He Cys As? Glu Cys His Ser Gin Asp Ala Thr Thr He Leu Gly 



35 



40 



45 



He Glv Th- Val Leu Aso Gin Ala Glu Thr Ala Gly Ala Arg Leu Val 

55 €0 



50 



Val Leu Ala Th 
65 



hr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro 



70 



75 



80 



Asn lie Glu Glu Val Ala Leu Pro Gin Glu Gly Glu Val Pro Phe Tyr 

90 * 5 



85 



Gly Arg Ala He Pro L- 



su Ala Phe He Lys Gly Gly Arg His Leu lie 



100 



105 



110 



lu Leu Ala Lys Gin Leu Thr 



Ph<* Cys His Ser Lys Lys Lys Cys Asp G 

120 125 



115 



Ser Le 
130 



u Gly Val Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ala 
135 1^0 
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Val lie Pro Thr Thr Gly Asp Val Val Val Cys Ser Thr Asp Ala Leu 
145 150 155 160 

Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val lie Asp Cys Asn Ser 
155 170 175 

Ala Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr lie 
130 185 190 

Glu Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Trp 
195 200 205 

Gly Arg Thr Gly Arg Gly Arg Kis Gly lie Tyr Arg Tyr Val Ser Ala 
210 215 220 

Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Val Val Leu Cys Glu Cys 
225 230 235 24Q 

Tyr Asp Ala Gly Cys Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr 
245 250 255 

Val Arg Leu Arg Ala Tyr Xaa Asn Thr Pro Gly Leu Pro Val Cys Gin 
260 265 270 

Asp Kis Leu Glu Phe Trp Glu Gly Val Phe Thr Gly Leu Thr Asn lie 
275 28C 235 

Asp Ala His Met Leu Ser Gin Thr Lys Gin Gly Gly Glu Asn Phe Pro 
290 295 300 

Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Val Arg Ala Lys Ala Pro 
305 310 315 320 

Pro Pro Ser Trp Asp Thr Met Trp Lys Cys Met Leu Arg Leu Lys Pro 
325 330 335 

Xaa Leu Thr Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gin 
340 345 350 

Asn Glu He Thr Leu Thr His Pro He Thr Lys Tyr He Met Ala Cys 
355 360 365 

Met Ser Ala Asp Leu Glu Val He Thr Ser Thr Trp Val Leu Val Gly 
370 375 380 

Gly Val Val Ala Ala Leu Ala Ala Tyr Cys Leu Thr Val Gly Ser Val 
385 390 395 400 

Ala lie Val Gly Arg He lie Leu Ser Gly Lys Pro Ala lie lie Pro 
405 410 415 

Asp Arg Glu Val Leu Tyr Gin Gin Phe As? Giu Met Glu Glu Cys Ser 
420 425 430 



Ala Ser Leu Pro Tyr Met Asp Glu Thr Arg Ala He Ala Gly Gin Phe 
435 440 " 445 
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Lys Glu Lys Val Leu Gly Phe lie Ser Thr Thr Gly Gin Lys Ala Glu 
450 455 460 

Thr Leu Lys Pro Ala Ala Thr Ser Val Trp Asn Lys Ala Glu Gin Phe 
465 470 47S 430 

Trp Xaa Thr Tyr 

{2} INFORMATION FOR SEC ID NO : 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1435 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

0 (ix) FEATURE: 

Up (A} NAME /KEY : CDS 

SB (3) LOCATION: 1 . . 1435 

y 

«1 (Xi) SEQUENCE DESCRIPTION: SEC ID NO: 199 : 

f p TGTGCCAGGA CCATCACCAC CGGAGCTTCT ATCACATACT CCACTTACGG CAAGTTCCTT 6 0 

GCTGA-GGAG GGTGTTCAGG CGGCGCGTA7 GACG7GATCA TATGCGACGA GTGCCATTCC 12 0 

D 

01 CAGGACGCCA CCACCATTCT TGGGA7AGGC ACTGTCCTTG ACCAGGCAGA GACGGCTGGA 18 0 

□ 

[f! GCTAGGCTCG TCGTCTTGGC CACGGCCACC CCTCCCGGCA GTGTGACAAC GCCCCACCCC 24 0 

o 

AACATCGAGG AAGTGGCCCT GCCTCAGGAG GGGGAGGTTC CCTTCTACGG CAGAGCCATT 300 

CCCCTTGCTT TTATAAAGGG TGGTAGGCAT CTCATCTTCT GCCATTCCAA GAAAAAATGT 36 0 
GATGAACTCG CCAAGCAACT GACCAGCC7G GGCGTGAACG CCGTGGCATA TTATAGAGGT 

CTAGACGTCG CCGTCATCCC CACAG CAGGA GACGTGGTCG TGTGCAGCAC CGACGCGCTC 4 30 

ATGACGGGAT TCACCGGCGA CTTTGATTCT GTCATAGACT GCAACTCCGC CGTCACTCAG 54 0 

ACGGTGGACT TCAGTCTGGA TCCCACTTTT ACCATTGAGA CTACCACAGT GCCCCAGGAC 6 00 



GCAGTGTCCA GAAGCCAGCG TAGGGGCCGC ACGGGGAGAG GTAGGCACGG CATATACCGG 
TATGTCTCGG CTGGAGAGAG ACCNTCTGAC ATGTTCGACT CCGTGGTGCT CTGTGAGTGC 

TACGATGCCG GATGTGCGTG GTATGATCTG ACTCCTGCCG AGACTACCGT GAGGTTG CGC 

GCTTACATAA ACACCCCCGG GCTCCCTGTC TGTCAGGACC ATTTGGAATT CTGGGAGGGG 

GTGTTCACGG GGCTCACTAA CATCGACGCT CACATGCTGT CACAGACCAA ACAGGGTGGG 

GAGAATTTNC CATACCTTGT AGCGTACCAA GCAACAGTCT GTGTTCGCGC GAAAGCGCCC 



420 



660 
720 

780 

840 

900 

960 
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CCCCCCAGCT GGGACACAAT GTGGAAATGC ATGCTCCGTC TCAAACCGAC TTTAACTGGC o 

CCTACTCCCC TCTTGTACAG GCTGGGGCCC GTCCAGANTG AGATCACACT GACGCACCCC ■) 

ATCACCAAGT ACATTATGGC TTGCATGTCT GCGGACTTGG AGGTCATTAC CANCACTTGG X\ 

GTTCTGGTGG GGGGCGTTGT GGCGGCCCTG GCGGCCTACT GCTTGACGGT GGGTTCGGTA i2i 

GCCATAGTCG G TAG GAT CAT CCTCTCTGGG AAACCTGCCA TCATTCCCGA TAGGGAGGCA ±2& 

TTATACCAGC AATTTGATGA GATGGAGGAG TGCTCGGCCT CGTTGCCCTA TATGGACGAG 13 2& 

ACACGTGCCA TTGCCGGACA ATTCAAAGAG AAAGTGCTCG GCTTCATCAG CACGACCGGC i3S0 

CAGAAGGCTG AAACTCTGAA GCZGGCXGCC ACGTCTGTGT GGAACAAGGC TGAGCAGTTC 14 40 

TGGGCCACAT ACATGTGGAA CTTCATCAGC GGGATACAAT AATAG 14 8 5 



(2) INFORMATION FOR SEQ ID NO : 20 0: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 84 amino acids 
(3) TYPE : atr.i.ic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

Cys Ala Arg Thr lie Thr Thr Gly Ala Ser He Thr Tyr Ser Thr Tyr 
15 10 15 

Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp Val 
20 25 30 

He He Cys Asp Glu Cys His Ser Gin Asp Ala Thr Thr He Leu Gly 
35 40 45 

He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val 
50 55 60 

Val Leu Ala Thr Ala Thr Pre Pro Gly Ser Val Thr Thr Pro. His Pro 
65 70 75 80 

Asn He Glu Glu Val Ala Leu Pro Gin Glu Gly Glu Val Pro Phe Tyr 
85 90 95 

Gly Arg Ala He Pro Leu Ala Phe He Lys Gly Gly Arg His Leu He 
100 105 13-0 

Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Lys Gin Leu Thr 
115 120 125 
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Ser Leu Gly Val Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ala 
130 135 140 

Val He Pro Thr Ala Gly Asp Val Val Val Cys Ser Thr Asp Ala Leu 
145 150 155 160 

Met: Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Ser 
1S5 170 175 

Ala Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He 

180 135 190 

Glu Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg 
195 200 205 

Gly Arg Thr Gly Arg Gly Arg His Gly He Tyr Arg Tyr Val Ser Ala 
210 215 220 



•-ft 



Gly Glu Arg Xaa Ser Asp Met Phe Asp Ser Val Val Leu Cys Glu Cys 
225 230 235 240 

Tyr Asp Ala Gly Cys Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr 
243 250 255 

Val Arg Leu Arg Ala Tyr He Asn Thr Pro Gly Leu Pro Val Cys Gin 
260 265 270 

Asp Kis Leu Glu Phe Trp Glu Gly Val Phe Thr Gly Leu Thr Asn lie 
275 280 285 



Asp Ala His Met Leu Ser Gin Thr Lys Gin Gly Gly Glu Asn Xaa Pro 
290 295 300 

Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Val Arg Ala Lys Ala Pro 
305 310 315 320 

Pro Pro Ser Trp Asp Thr Met: Trp Lys Cys Met: Leu Arg Leu Lys Pro 
325 330 335 

Thr Leu Thr Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gin 
340 345 350 

Xaa Glu He Thr Leu Thr His Pro He Thr Lys Tyr He Mec Ala Cys 
355 360 365 

Met Ser Ala Asp Leu Glu Val He Thr Xaa Thr Trp Vai Leu Val Gly 
370 375 380 

Gly Val Val Ala Ala Leu Ala Ala Tyr Cys Leu Thr Val Gly Ser Val 
385 390 395 400 

Ala He Val Gly Arg He lie Leu Ser Gly Lys Pro Ala lie He Pro 
405 410 415 

Asp Arg Glu Ala Leu Tyr Gin Gin Phe Asp Glu Met Glu Glu Cys Ser 
420 425 430 



Ala Ser Leu Pro Tyr Met Asp Glu Thr Arg Ala He Ala Gly Gin Phe 
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435 440 445 

Lys Glu Lys Val Leu Gly Phe lie Ser Thr Thr Gly Gin Lys Ala Glu 
450 455 460 

Thr Leu Lys Pro Ala Ala Thr Ser Val Trp Asn Lys Ala Glu Gin Phe 
465 " 470 475 480 



Trp Ala Thr Tyr 
(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 base pairs 

(B) TYPE; nucleic acid 

( C ) STRANDEDNES S : S ing 1 e 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



^ (iii) HYPOTHETICAL: NO 

3 (iii) ANTI-SENSE : NO 



GCT GCG GGG CTG AAG GAC TGC ACC ATG CTG GTT TGC GGT GAC GAC TTA 
Ala Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cys Gly Asp As? Leu 
80 85 90 95 



45 



yj (ix) FEATURE: 

nj (A) NAME / KEY : CDS 

flj (B) LOCATION: 2.. 340 

7" (ix) FEATURE: 

Ll (A) NAME / KEY : mat_peptide 

(B) LOCATION: 2. .33 7 

III (Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 201: 

C TCC ACT GTG ACT GAG AGA GAC ATC AGG GTC GAA GAA GAA GTC TAT 
Ser Thr Val Thr Glu Arg Asp lie Arg Val Glu Glu Glu Val Tyr 
1 5 10 15 

CAG TGT TGT GAT CTG GAG CCC GAG GCC CGC AAG GTA ATA ACC GCC CTC 94 
Gin Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val lie Thr Ala Leu 
20 25 30 

ACG GAG AGA CTC TAC GTG GGC GGC CCT ATG TAC AAT AGC AAG GGA GAC 14 2 

Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp 
35 40 45 

CTT TGC GGG TAT CGC AGG TGC CGC GCA AGC GGC GTA TAT ACC ACC AGC 190 
Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 
50 55 60 

TTC GGG AAC ACA CTG ACG TGC TAC CTT AAA GCC TCA GCA GCC ATC AGG 2 38 

Phe Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala He Arg 
65 ™ 75 



236 
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GTC GTG ATC GCT GAA AGC GGT GGC GTC GAG GAG GAC AAG CGA GCC CTC 
Val Val He Ala Glu Ser Gly Gly Val Glu Glu Asp Lys Arg Ala Leu 
100 105 HO 



334 



340 



GGA GCT 
Gly Ala 



(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
I Ser Thr val Thr Glu Arg Asp He Arg Val Glu Glu Glu Val Tyr Gin 

^ Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val He Thr Ala Leu Thr 

iy 20 25 



Glu Arg Lau Tvr Val Gly Gly Pro Me, Tyr Asn Ser Lys Gly Asp Leu 

35 * 40 , 43 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 

50 53 
Gly Asa Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala He Arg Ala 

65 70 7= 

Ala Gly Leu Lys Asp Cys Thr Mec Leu Val Cys Gly As? Asp Leu Val 



85 90 



Val He Ala Glu Ser Gly Gly Val Glu Glu Asp Lys Arg Ala Leu Gly 

ioo i° 5 110 

Ala 

(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 340 

(ix) FEATURE: 

(A) NAME/KEY: mac_peptide 

(3) LOCATION: 2. .337 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

C TCC ACA GTG ACT GAA AGA GAC ATC AGG GTC GAG GAA GAG GTC TAG 4 6 

Ser Thr Val Thr Glu Arg Asp He Arg Val Glu GIu Glu Val Tyr 
15 10 15 

CAG TGT TGT GAC CTG GAG CCT GAA ACC CGC AAG GTA ATA TCT GCC CTC 94 
Gin Cys Cys Asp Leu Glu Pro Glu Thr Arg Lys Val He Ser Ala Leu 
20 25 30 

ACT GAA AGA CTC TAT GTG GGC GGT CCC ATG CAC AAC AGC AGG GGA GAC 14 2 

Thr Glu Arg Leu Tyr Val Gly Gly Pro Met His Asa Ser Arg Gly Asp 
35 40 45 

CTA TGC GGG TAC CGT AGA TGC CGC GCG AGC GGC GTA TAC ACC ACA AGC 12 0 

Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 
50 55 60 

TTC GGG AAC ACT CTG ACG TGC TTC CTC AAG GCC ACA GCG GCC ACC AAA 23 3 

Phe Gly Asn Thr Leu Thr Cys Phe Leu Lys Ala Thr Ala Ala Thr Lys 
65 70 75 

GCC GCT GGC CTA AAG GAC TGC ACC ATG TTG GTG TGT GGT GAC GAC TTA 286 
Ala Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu 
80 * 35 90 95 

GTC GTT ATC GCC GAA AGC GAT GGT GTC GAA GAG GAC CGC CGA GCC CTC 3 34 

Val Val He Ala Glu Ser Asp Gly Val Glu Glu Asp Arg Arg Ala Leu 
100 105 HO 



GGA GCT 
Gly Ala 



340 



(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

Ser Thr Val Thr Glu Arg As? He Arg Val Glu Glu Glu Val Tyr Gin 
1 5 10 15 
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rv <- Asp Leu Glu Pro Glu Thr Arg Lys Val He Ser Ala Leu Thr 
20 2S 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Arg Gly Asp Leu 
35 40 45 

rg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser ?he 



Cys Gly Tyr Arg Arg 

50 55 



60 



Gly Asn Thr Leu Thr Cys Phe Leu Lys Ala Thr Ala Ala Thr Lys Ala 
65 ™ 75 30 

Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 
85 9° 95 

Va^ Tie Ala Glu Ser As? Gly Val Glu Glu Asp Arg Arg Ala Leu Gly 
100 "5 HO 

f"i Ala 

53 (2) INFORMATION FOR SEQ ID NO: 205: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 0 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 
(3) LOCATION: 2 . .34 0 

(ix) FEATURE: 

(A) NAME/KEY: mat_i>eptide 
(3) LOCATION: 2. .337 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

C TCC ACQ GTG ACC GAA AGG GAT ATC AGG ACC GAG GAA GAG ATC TAC 
Ser Thr Vai Thr Glu Arg Asp lie Arg Thr Glu Glu Glu He Tyr 
! 5 10 15 

CAG TGC TGC GAC CTG GAG CCC GAA GCC CGC AAG GTG ATA TCC GCC CTA 
Sn C/s Asp Leu Glu Pro Glu Ala Arg Lys Val lie Ser Ala Leu 

20 25 

ACG GAA AGA CTC TAC GTG GGC GGT CCC ATG TAC AAC TCC AAG GGG GAC 
Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp 



46 



94 



142 



35 



40 
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CTA TGC GGG CAA CGG AGG TGC CGC GCA AGC GGG G7C TAC ACC ACC AGC 
Leu Cys Gly Gin Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 
50 55 60 



190 



TTC GGG AAC ACT GTA ACG TGT TAT CTC AAG GCC GTT GCG GCT ACT AGG 
Phe Gly Asn Thr Val Thr Cys Tyr Leu Lys Ala Val Ala Ala Thr Arg 
65 70 75 



238 



GCC GCA GGT CTG AAA GGT TGC AGC ATG CTG GTT TGT GGA.GAC GAC TTA 
Ala Ala Gly Leu Lys Gly Cys Ser Met Leu Val Cys Gly Asp Asp Leu 
80 85 90 95 



236 



GTC GTC ATC TGC GAG AGC GGC GGC GTA GAG GAG GAT GCA AGA GCC CTC 
Val Val He Cys Glu Ser Gly Gly Val Glu Glu Asp Ala Arg Ala Leu 
100 105 110 



334 



CGA GCC 



340 



(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

Ser Thr Val Thr Glu Arg Asp He Arg Thr Glu Glu Glu lie Tyr Gin 
15 10 15 

Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val He Ser Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp Leu 
35 40 45 

Cys Gly Gin Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 
50 55 60 

Gly Asn Thr Val Thr Cys Tyr Leu Lys Ala Val Ala Ala Thr Arg Ala 
65 70 75 80 

Ala Gly Leu Lys Gly Cys Ser Met Leu Val Cys Gly Asp Asp Leu Val 
85 90 95 

Val He Cys Glu Ser Gly Gly Val Glu Glu As? Ala Arg Ala Leu Arg 
100 105 HO 

Ala 

(2) INFORMATION FOR SEQ ID NO: 207: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 
{D} TOPOLOGY: linear 

<ii} MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



Q 

!°4 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 340 

(ix) FEATURE: 

(A) NAME /KEY : mac ^peptide 

(B) LOCATION: 2 . . 337 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20' 



C TCC ACG GTG ACT GAA AGG GAC ATT AGG GTC GAG GAA GAG ATC TAC 
Se- Thr Val Thr Glu Arg Asp He Arg Val Glu Glu Glu He Tyr 
\U * 5 10 15 

m 1 

CAG TGC TGT GAC CTG GAG CCC GAG GCA CGC AAG GTG ATA TCC GCT CTC 
Gln cys Cys Aso Leu Glu Pro Glu Ala Arg Lys Val He Ser Ala Leu 
O 20 25 30 



ACA GAA AGA CTC TAC AAG GGC GGC CCC ATG TAT AAC AGC AAG GGG GAC 
Th- Glu Arc Leu Tyr Lvs Gly Gly Pro Met Tyr Asn Ser Lys Gly As? 
35 40 45 

CTA TGC GGG CTT CGG AGG TGC CGC GCA AGC GGG GTA TAC ACC ACA AGC 
Leu Cys Gly Leu Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 
50 55 60 

TTr GGG AAC ACG GTG ACA TGC TAC CTT AAA GCC ACA GCA GCC ACC AGG 
Phe Gly Asn Thr Val Thr Cys Tyr Leu Lys Ala Thr Ala Ala Thr Arg 
6S 70 75 

GCT GCA GGG CTG AAA GAT TGC ACT ATG CTG GTA TGC GGT GAC GAC TTA 
Ala Ala Gly Leu Lys Asp Cys Thr Me. Leu Val Cys Gly Asp Asp Leu 
80 85 90 9= 

GTC GTT ATT GCC GAA AGC GGT GGC GTG GAG GAG GAC GCC CGA GCC CTC 
Vai val He Ala Glu Ser Gly Gly Val Glu Glu Asp Ala Arg Ala Leu 
100 105 11° 

CGA GCC 
Arg Ala 



46 



94 



142 



190 



238 



286 



334 



340 



(2) INFORMATION FOR SEQ ID NO: 208: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pro Cain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 
3er Thr Val Thr Glu Arg Asp He Arg Val Glu Glu Glu He Tyr Gin 
15 10 15 

Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val He Ser Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Lys Gly Gly Pro Met: Tyr Asn Ser Lys Gly Asp Leu 
35 40 45 

Cys Gly Leu Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 
50 55 60 

^ Gly Asn Thr Val Thr Cys Tyr Leu Lys Ala Thr Ala Ala Thr Arg Ala 

5f 65 70 75 SC 

WW 

Nl Ala Gly Leu Lys Asp Cys Thr Men Leu Val Cys Gly Asp Asp Leu Val 
UJ 8 5 90 9 5 

fU 

fU Val He Ala Glu Ser Gly Gly Val Glu Glu Asp Ala Arg Ala Leu Arg 
a p 100 105 110 



Ala 



(2) INFORMATION FOR SEQ ID NO: 2 05: 



iJ (i) SEQUENCE CHARACTERISTICS: 

H= (A) LENGTH: 34 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANT I- SENSE: NO 



(ix) FEATURE : 

(A) NAME/KEY: CDS 
(3) LOCATION: 1. .340 



(xi) SEQUENCE DESCRIPTION: *SEQ ID NO: 209: 
CCCCACCGTG ACNGAGAGGG ACNTCAGGGT CGAGGAAGAG GTCTATCAGT GCTGTAATCT 6 0 

GGAGNCCGAT GNCCGCAAGG TCATCAACGC CCTCACAGAG AGACTCTACG TGGGCGGCCC 12 0 
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TATGCACAAC AGCAAGGGAG ACCTGTGTGG CATCCGTAGA TGCCGCGCGA GCGGCGTTTA 180 
CACCACGAGC TTCGGAAACA CGCTGACTTG CTACCTCAAA GCCACAGCGG CCACCAGGGC 240 
CGCGGGCTTG AAGGATTGCA CCATGCTGGT CTGCGGNGAC GACCTGGTTG TCATTGCTGA 3 00 

GAGCATTGGC ATAGACGAGG ACAAGCAAGC CCTCCGStACT 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC : 210: 

Pro Th~ Val Thr Glu Arg As? Xaa Arc Val Giu Glu Giu Vai Tyr Gin 
1 5 10 15 

Cys Cys Asn Leu Glu Xaa Asp Xaa Arg Lys Val lie Asn Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Val Gly Gly Pro Mec His Asn Ser Lys Gly Asp Leu 
35 40 45 

Cys Gly He Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 
SO 55 60 

Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Thr Ala Ala Thr Arg Ala 
65 ™ 75 

Ala Gly Leu Lys Asp Cys Thr Met Leu Vai Cys Gly As? As? Leu Vai 
85 9° 9= 

Vai He Ala Glu Ser lie Gly He Asp Glu Asp Lys Gin Ala Leu Arg 
100 105 110 

Thr 



(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/Z2Y: CDS 

(B) LOCATION : I. .340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

CTCGACTGTG NCCGAGAGGG ACATCAGGAC AGAGGGAGAG GTCTATCAGT GTTGCGACCT 60 

GGAACCGGAA GCCCGCAAGG TAATCACCGC CCTCACTGAG AGACTCTATG TGGGCGGACC 12 0 
CATGTTCAAC AGCAAGGGAG ACCTGTGCGG ACAACGCCGG TGCCGCGCAA GCGGCGTGTT - 18 0 

CACCACCAGC TTCGGGAACA CACTGACGTG CTACCTTAAA GCCACAGCTG CTACT AG AG C 24 0 

AGCCGGCTTA AAAGATTGCA CCATGCTGGT CTGCGGTGAC GACTTAGTCG TTATTTCCGA 3 00 

GAGCGCCGGT GTGGAGGAGG ATCCCANAAC CCNNCGACCN 34 0 



HJ (2) INFORMATION FOR SEQ ID NO: 212: 

nt 

,p (i) SEQUENCE CHARACTEP.ISTICS : 

e (A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE3S : single 
CD) TOPOLOGY: linear 



(ii) MOLECULE TYPE : cDNJ 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

Ser Thr Vai Xaa Glu Arg Asp lie Arg Thr Giu Gly Glu Val Tyr Gin 
1 5 10 15 

Cys Cys Asp Leu Giu Pro Glu Ala Arg Lys Val He Thr Ala Leu Thr 
20 25 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met Phe Asn Ser Lys Gly Asp Leu 
35 40 45 

Cys Gly Gin Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser Phe 
50 55 °° 

Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Thr Ala Ala Thr Arg Ala 
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65 70 75 80 

Ala Gly Leu Lys Asp Cys Thr Mec Leu Val Cys Gly Asp Asp Leu Val 
85 90 95 

Val lie Ser Giu Ser Ala Gly Val Glu Glu Asp Pro Xaa Thr Xaa Arg 
100 105 110 

Pro 



(2) INFORMATION FOR 3EQ ID MO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



\£i (iii) HYPOTHETICAL: NO 

ID 

Sj (iii) ANT I- SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2 . .340 

(ix) FEATURE: 

(A) NAME / KEY : mat_pepmde 
(3) LOCATION: 2. .3 37 



in 



( X i) SEQUENCE DESCRIPTION : SEQ ID NO: 213: 

C TCA ACA GTC ACC GAG AAC GAC ATC CGT GTT GAG GAG TCA ATT TAC 
Ser Thr Val Thr Glu Asn Asp lie Arg Val Glu Glu Ser lie Tyr 
is 10 15 



GOT GCG AAG CTC CAG GAC TGC ACG ATG CTC GTG TGC GGG GAC GAC CTT 



46 



CAA TGT TGT GAC TTG GCC CCC GAG GCC AGA CAG GCC ATA AAG TCG CTC 94 
Gin Cys Cvs Aso Leu Ala Pro Glu Ala Arg Gin Ala He Lys Ser Leu 
20 25 30 

ACA GAG CGG CTT TAT ATC GGG GGT CCC CTG ACT AAT TCA AAG GGG CAG 14 2 

Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly Gin 
35 40 45 

AAC TGT GGC TAT CGC CGA TGC CGC GCA AGC GGC GTG CTG ACG ACC AGC 190 
Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser 
50 55 60 

TGC GGT AAT ACC CTT ACA TGT TAC"CTA AAG GCC TCT GCA GCC TGT CGA 23 8 

Cvs Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys Arg 
65 7C 75 



286 



SUBSTITUTE SHEET (RULE 26) 



WO 94/254 



268 



PCT/EP94/01323 



Ala Ala Ly3 Leu Gin Asp Cys Thr Met: Leu Val Cys Gly Asp Asp Leu 
80 85 90 95 



GTC GTT ATC TGT GAA AGC GCG GGA ACC CAA GAG GAC GCG GCG AGC CTA 
Val Val lie Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser Leu 
100 105 no 



334 



CGA GTC 
Arg Val 



340 



(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

™ (ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

\I Ser Thr Val Thr Glu Asn Asp lie Arg Val Glu Glu Ser lie Tyr Gin 

Wl5 10 15 

Hi 

f\J Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Lys Ser Leu Thr 

42 20 25 30 

«s* Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly Gin Asn 

5 3 5 4 0 4 5 



U 



Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys 
50 55 SO 

Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys Arg Ala 
65 70 75 80 

Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 
85 90 95 

Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser Leu Arg 
100 105 HO 

Val 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
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{iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2. .340 

(ix) FEATURE: 

(A) NAME/ KEY : mat_pe?txde 

(B) LOCATION: 2 . . 340 




PCT/EP94/01323 

269 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

C TCA ACC GTC ACG GAG AGG GAT ATA AGA ACA GAA GAA TCC ATA TAT 4 6 

Ser Thr Val Thr GIu Arg Asp lie Arg Thr Glu Glu Ser He Tyr 

ESS? 

m CAA GCT TGT TCC CTG CCC CAA GAG GCC AGA ACT GTC ATA CAC TCG CTC 94 

£0 Gin Ala Cys Ser Leu Pro Gin Glu Ala Arg Thr Val tie His Ser Leu 

%j 20 25 30 



i U 



AGA GCT 
Arg Ala 



ACC GAG AGA CTC TAC GTG GGA GGG CCC ATG ATA AAC AGC AAA GGG CAA 14 2 

Thr Glu Arg Leu Tyr Val Gly Giy Pro Mec He Asn Ser Lys Gly Glr. 
35 40 45 

TCC TGC GGT TAC AGG CGT TGC CGC GGA AGC GGT GTT TTC ACC ACC AGC 190 
Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser 
50 55 60 

ATG GGG AAT ACC ATG ACG TGT TAC ATC AAA GCC CTT GCA GCG TGT AAA 23 8 

Met Gly Asn Thr Met Thr Cys Tyr He Lys Ala Leu Ala Ala Cys Lys 
65 70 75 

GCC GCA GGG ATC GTG GAC CCC GTC ATG CTG GTG TGT GGA GAC GAC CTG 28 6 

Ala Ala Glv He Val Aso Pro Val Met Leu Val Cys Gly Asp Asp Leu 
80 * 85 90 95 

GTC GTC ATC TCG GAG AGC CAG GGT AAC GAG GAG GAC GAG CGA AAC CTG 3 34 

Val Val lie Ser Glu Ser Gin Gly Asn Glu Glu Asp Glu Arg Asn Leu 
100 105 HO 



340 



(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

Ser Thr Val Thr Glu Arg Asp lie Arg Thr Glu Glu Ser lie Tyr Gin 
1 5 10 15 

Ala Cys Ser Leu Pro Gin Glu Ala Arg Thr Val lie His Ser Leu Thr 
20 25 30 

Glu Arg Leu Tyr Val Gly Gly Pro Met lie Asn Ser Lys Gly Gin Ser 
35 40 45 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser Met 
50 55 60 

Gly Asn Thr Met Thr Cys Tyr lie Lys Ala Leu Ala Ala Cys Lys Ala 
65 70 75 80 

Ala Gly lie Val Asp Pro Val Met Leu Val Cys Gly Asp Asp Leu Val 
95 90 95 

Val lie Ser Glu Ser Gin Gly Asn Glu Glu Asp Glu Arg Asn Leu Arg 
100 105 110 

Ala 



(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
(3) LOCATION: 2 . . 340 

(ix) FEATURE: 

(A) NAME / KEY : mat_peptide 

(B) LOCATION: 2. .340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

C TCG ACT GTC ACT GAA CAG GAC ATC AGG GTG GAA GAG GAG ATA TAT 4o 
Ser Thr Val Thr Glu Gin Asp lie Arg Val Glu Glu Glu He Tyr 
1 5 10 15 

CAA TGC TGC AAC CTT GAA CCG GAG GCC AGG AAA GTG ATC TCC TCC CTC 94 
Gin Cys Cys Asn Leu Glu Pro Glu Ala Arg Lys Val lie Ser Ser Leu 
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20 



25 



30 



ACG GAG CGG CTT TAC TGC GGA GGC CCT ATG 
Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met 
35 40 



TTT AAC AGC AAG GGG GCC 
Phe Asn Ser Lys Gly Ala 
45 



142 



CAG TGT GGT TAT CGC CGT TGC CGT 
Gin Cys Gly Tyr Arg Arg Cys Arg 
50 55 



gc: 



AGT GGA GTT CTG CCT ACC AGC 
Ser Gly Val Leu Pro Tiir Ser 
60 



190 



TTT GGC AAC ACA ATC ACT TGT 
Phe Gly Asn Thr lie Thr Cys 
65 70 



TAC ATC 
Tyr lie 



AAG GCC ACA ACG GCC GCG AAG 
Lys Ala Thr Thr Ala Ala Lys 
75 



238 



GCC GCA GGC CTC CGG AAC CCG GAC 
Ala Ala Gly Leu Arg Asn Pro Asp 

80 85 
GTC GTG GTG GCT GAG AGT GAT GGC 
Val Val Val Ala Glu Ser Asp Gly 
100 

AGA GCC 
Arg Ala 



Phe 

GTC 
Val 



CTT GTC TGC GGA GAT GAT CTG 
Leu Val Cys Gly Asp Asp Leu 
90 95 
GAC GAG GAT AGA GCA GCC CTG 
Asp Glu Asp Arg Ala Ala Leu 
105 110 



286 



334 



340 



(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

Ser Thr Val Thr Glu Gin Asp He Arg Val Glu Glu Glu He Tyr Gin 
1 5 " 10 15 

Cys Cys Asn Leu Glu Pro Glu Ala Arg Lys Val He Ser Ser Leu Thr 
20 25 30 

Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly Ala Gin 
35 40 45 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr Ser Phe 
50 55 60 

Gly Asn Thr He Thr Cys Tyr He Lys Ala Thr Thr Ala Ala Lys Ala 
65 70 75 80 

Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp Leu Val 
85 90 95 

Val Val Ala Glu Ser Asp Gly Val Asp Glu Asp Arg Ala Ala Leu Arg 
100 105 HO 
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Ala 



. 

IS 
\j 

: ,. i 

TU 
m 

i TM? 

I 

in 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

( D ) TO POLOGY : 1 inear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

Arg Ser Glu Gly Arg Thr Ser Trp Ala Gin 
15 10 

(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 220: 

Arg Ser Glu Gly Arg Thr Ser Trp Ala Gin 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

Arg Thr Glu Gly Arg Thr Ser Trp Ala Gin 
15 10 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 9 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



FEATURE: 

(A) NAME / KEY : 
<B) LOCATION: 

FEATURE: 

(A) NAME/XEY: 

(B) LOCATION: 



CDS 

3 . .629 



mat_pe?tide 
3. .629 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

TA GAC TTT TGG GAG AGC GTC TTC ACT GGA CTA ACT CAC ATA GAT GCC 
Asp Phe Tro Glu Ser Val Phe Thr Gly Leu Thr His lie As? Ala 
i ' 5 10 15 

CAC TTT CTG TCA CAG ACT AAG CAG CAG GGA CTC AAC TTC TCG TTC CTG 
His Phe Leu Ser Gin Thr Lys Gin Gin Gly Leu Asn Phe Ser Phe Leu 
20 25 30 

ACT GCC TAC CAA GCC ACT GTG TGC GCT CGC GCG CAG GCT CCT CCC CCA 
Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pre Pro 
35 40 45 

AGT TGG GAC GAG ATG TGG AAG TGT CTC GTA CGG CTT AAG CCA ACA CTA 
Ser Trp Asp Glu Met Trp Lys Cys Leu Val Arg Leu Lys Pro Thr Leu 
50 55 60 

CAT GGA CCT ACG CCT CTT CTA TAT CGG TTG GGG CCT GTC CAA AAT GAA 239 
His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gin Asn Glu 
65 ™ 75 



ATC TGC TTG ACA CAC CCC ATC ACA AAA TAC ATC ATG GCA TGC ATG TCA 
lie Cys Leu Thr His Pro He Thr Lys Tyr lie Met Ala Cys Met Ser 

85 90 9:3 



80 



GCT GAT CTG GAA GTA ACC ACC AGC ACC TGG GTT TTG CTT GGA GGG G.C 
Ala Aso Leu Glu Val Thr Thr Ser Thr Trp Val Leu Leu Gly Gly Va. 

100 105 110 

CTC GCG GCC CTA GCG GCC TAC TGC TTG TCA GTC GGT TGT GTT GTG ATT 
Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys Val Val He 
115 120 125 

GTG GGT CAT ATC GAG CTG GGG GGC AAG CCG GCA ATC GTT CCA GAC AAA 
Val Gly His He Glu Leu Gly Gly Lys Pro Ala He Val Pro As? Lys 
130 135 140 



47 



95 



143 



191 



287 



335 



333 



431 
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GAG GTG TTG TAT CAA CAA TAC GAT GAG ATG GAA GAG TGC TCA CAA GCT 
Glu Val Leu Tyr Gla Gin Tyr Asp Glu Met Glu Glu Cys Ser Gin Ala 
145 150 155 



479 



GCC CCA TAT ATC GAA CAA GCT CAG GTA ATA GCT CAC CAG TTC AAG GAA 
Ala Pro Tyr He Glu Gin Ala Gin Val He Ala His Gin Phe Lys Glu 
160 165 170 175 



527 



AAA GTC CTT GGA TTG CTG CAG CGA GCC ACC CAA CAA CAA GCT GTC ATT 
Lys Val Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin Gin Ala Val He 
130 135 190 



575 



GAG CCC ATA GTA ACT ACC AAC TGG CAA AAG CTT GAG GCC TTT TGG CAC 
Glu Pro lie Val Thr Thr Asn Trp Gin Lys Leu Glu Ala Phe Trp His 

195 200 205 



623 



AAG CAT 
Lys His 



629 



(2) INFORMATION FOR SEQ ID NC : 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 9 amine acids 

(B) TYPE: amine acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

Asp Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His lie Asp Ala His 
1 5 10 15 

Phe Leu Ser Gin Thr Lys Gin Gin Gly Leu Asn Phe Ser Phe Leu Thr 
20 25 30 

Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser 
35 40 45 

Trp Asp Glu Met Trp Lys Cys Leu Val Arg Leu Lys Pro Thr Leu His 
50 55 60 

Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gin Asn Glu He 
65 70 75 80 

Cys Leu Thr His Pro He Thr Lys Tyr He Met Ala Cys Met Ser Ala 
85 90 95 

Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu Leu Gly Gly Val Leu 
100 105 HO 

Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys Val Val He Val 
115 120 125 



Gly His He Glu Leu Gly Gly Lys Pro Ala He Val Pro Asp Lys Glu 
130 135 140 
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Val Leu Tyr Gin Gin Tyr Asp Glu Met Glu Glu Cys Ser Gin Ala Ala 
145 150 155 160 

Pro Tyr He Glu Gin Ala Gin Val He Ala His Gin Phe Lys Glu Lys 
165 170 175 

Val Leu Gly Leu Leu Gin Arg Ala Thr Gin Gin Gin Ala Val He Glu 
180 185 190 

Pro He Val Thr Thr Asn Trp Gin Lys Leu Glu Ala Phe Trp His Lys 
195 2C0 205 

His 



(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 2.. 12 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

lie His Tyr Arg Asn Ala Ser Gly lie Tyr His He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

Val Asn Tyr Arc Asn Ala Ser Gly He Tyr His He 
1 5 10 * 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 226: 

Val Asn Tyr Arg Asn Ala Ser Gly Val Tyr His lie 
15 10 

(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
(3) TYPE: amine acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 227: 

Val Asn Tyr His Asn Thr Ser Gly lie Tyr His Leu 
1 ' 5 10 

(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

Gin His Tyr Arg Asn Ala Ser Gly He Tyr His Val 
1*5 10 

(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 9: 

Gin His Tyr Arg Asn Val Ser Gly He Tyr His Val 
1 5 10 

(2) INFORMATION FOR SEQ 12 NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

He His Tyr Arg Asn Ala Ser Asp Gly Tyr Tyr He 
15 10 

(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

Leu Gin Val Lys Asn Thr Ser Ser Ser Tyr Met Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

Val Trp Gin Leu Arg Ala He Val Leu His Val 
1 5 10 

(2) INFORMATION FOR SEQ ID NO : 23 3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

Val Tyr Glu Ala Asp Tyr His lie Leu His Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 234: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

Val Tyr Glu Thr Asp Asn His lie Leu His Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO : 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

Val Tyr Glu Thr Glu Asn His lie Leu His Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
■(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

Val Phe Glu Thr Val His His lie Leu His Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

Val Phe Glu Thr Glu His His He Leu His Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

Val Phe Glu Thr Asp His His He Met His Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: "SEQ ID NO: 239: 

Val Tyr Glu Thr Glu Asn His He Leu His Leu 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 240: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

Val Tyr Glu Ala Asp Ala Leu lie Leu His Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

Val Gin Asp Gly Asn Thr Ser Ala Cys Trp Thr Pro Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
Val Lys Thr Gly Asn Gin Ser Arg Cys Trp Val Ala Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino -acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

Val Lys Thr Gly Asn Gin Ser Arg Cys Trp Val Ala Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE : amine acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

Val Arg Thr Gly Asn Gin Ser Arg Cys Trp Val Ala Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE : amine acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 245: 

Val Lys Thr Gly Asn Gin Ser Arg Cys Trp He Ala Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 
Val Lys Thr Gly Asn Gin Ser Arg Cys Trp He Ala Leu 
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15 10 
(2) INFORMATION FOR SEQ ID NO: 24 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

Val Lys Thr Gly Asn Ser Val Arg Cys Trp lie Pro Leu 
1 5 10* 

»* (2) INFORMATION FOR SEQ ID NC : 24 8: 

^ (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
y (C) STRAND EDNESS : single 

nJ (D) TOPOLOGY: linear 

sss : 
£ S E 

£ (ii) MOLECULE TYPE : peptide 



m 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

Val Lys Thr Gly Asn Val Ser Arg Cys Trp lie Ser Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 24 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

Val Arg Lys Asp Asn Val Ser Arg Cys Trp Val Gin lie 
15 10 

(2) INFORMATION FOR SEQ ID NO: 25 0: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 25 C: 

Ala Pro Ser Phe Gly Ala Val Thr Ala Pro 
15 10 
(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

( C ) STRAND EDNES S : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

Val Ser Gin Pro Gly Ala Leu Thr Lys Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 52: 

Val Lys Tyr Val Gly Ala Thr Thr Ala Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 
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Ala Pro Tyr He Gly Ala Pro Val Glu Ser 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



! i ; 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

Ala Gin His Leu Asn Ala Pro Leu Glu Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 255: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



yi (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 

M:' Ser Pro Tyr Val Gly Ala Pro Leu Glu Pro 

1 5 * 10 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

Ser Pro Tyr Ala Gly Ala Pro Leu Glu Pro 
1 5 io 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 257: 

Ala Pro Tyr Leu Gly Ala Pro Leu Glu Ser 
15 io 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

Ala Pro Tyr Leu Gly Ala Pro Leu Glu Ser 
15 io 

(2) INFORMATION FOR SEQ ID NO: 25 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

Ala Pro Tyr Val Gly Ala Pro Leu Glu Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 260: 

(i)' SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 
As n Val Pro Tyr Leu Gly Ala Pro Leu Thr Ser 



(2) INFORMATION FOR SEQ ID NO : 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

Ala Pro His Leu Arg Ala Pro Leu Ser Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 2 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE : amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 262: 

Ala Pro Tyr Leu Gly Ala Pro Leu Thr Ser 
15 io 

(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) ST RAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

Arg Pro Arc Gin His Ala Thr Val Gin Asp 
1 5 lcT 

(2) INFORMATION FOR SEQ ID NO: 254: 



1 



5 



10 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 

Ser Pro Gin His His Lys Phe Val Gin Asp 
15 xo 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 

Arg Pro Arg Arg Leu Trp Thr Thr Gin Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266 
Pro Pro Arg He His Glu Thr Thr Gin Asp 



(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 14 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 



Thr He Ser Tyr Ala Asn Gly Ser Gly Pro Ser Asp Asp Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amine acid 

(C) STRANDEDNESS : single 
* (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

Ser Arg Arg Gin Pro lie Pro Arg Ala Arg Arg Thr Glu Gly Arg Ser 
15 10 is" 

Trp Ala Gin 



(2) INFORMATION FOR SEQ ID NO: 26 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1443 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1 . . 1443 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

ACC ATC ACC ACC GGA GCT TCT ATC ACA TAC TCC ACT TAC GGC AAG TTC 4 8 

Thr lie Thr Thr Gly Ala Ser lie Thr Tyr Ser Thr Tyr Gly Lys Phe 
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CTT GCT GAT GGA 
Leu Ala Asp Gly 
20 

GAC GAG TGC CAT 
Asp GIu Cys His 
35 

GTC CTT GAC CAG 
Val Leu Asp Gin 
50 

ACG GCC ACC CCT 
Thr Ala Thr Pro 
65 

GAA GTG GCC CTG 
Glu Val Ala Leu 



GGG 
Gly 



TGT TCA GGC 
Cys Ser Gly 



TCC 
Ser 



GCA 
Ala 



CCC 
Pro 



ATT CCC 
He Pro 



TCC AAG 
Ser Lys 



GTG AAC 
Val Asn 
130 

ACA GCA 
Thr Ala 
145 

TTC ACC 
Phe Thr 



CTT GCT 
Leu Ala 

100 

AAA AAA 
Lys Lys 
115 

GCC GTG 
Ala Val 



CCT 
Pro 
85 

TTT 
Phe 



CAG GAC GCC 
Gin Asp Ala 
40 

GAG ACG GCT 
Glu Thr Ala 
55 

GGC AGT GTG 
Gly Ser Val 
70 

CAG GAG GGG 
Gin Glu Gly 



GGC 
Gly 
25 

ACC 
Thr 



GGA 

Gly 



10 

GCG TAT 
Ala Tyr 



15 



ACC ATT 

Thr He 



GCT AGG 
Ala Arg 



ACA 
Thr 



GAG 
Glu 



ATA AAG GGT 
He Lys Gly 



GGA GAC 
Gly Asp 



TGT 
Cys 



GCA 
Ala 



GTG 
Val 



GAT GAA CTC 
Asp Glu Leu 
120 

TAT TAT AGA 
Tyr Tyr Arg 
135 

GTC GTG TGC 
Val Val Cys 
150 



GGT 
Gly 
105 

GCC 
Ala 



GGC GAC 
Gly Asp 



CAG ACG 
Gin Thr 



ACA GTG 
Thr Val 



GGG AGA 
Gly Arg 
210 

CCG TCT 
Pro Ser 
225 



GTG GAC 
Val Asp 
180 

CCC CAG 
Pro Gin 
195 

GGT AGG 
Gly Arg 



GAC ATG 
Asp Met 



Phe 
165 

TTC 
Phe 



GAC 
Asd 



CAC 
Kis 



TTC 
Phe 



GAT TCT GTC 
Asp Ser Val 



AGT CTG GAT 
Ser Leu Asp 



GCA GTG TCC 
Ala Val Ser 
200 

GGC ATA TAC 
Gly He Tyr 
215 

GAC TCC GTG 
Asp Ser Val 
230 



GGT 
Gly 



AGC 
Ser 



ATA 
lie 



ACG CCC 
Thr Pro 
75 

GTT CCC 
Val Pro 
90 

AGG CAT 
Arg His 



AAG CAA 
Lys Gin 



CTA GAC 
Leu Asp 



ACC GAC 
Thr Asp 
155 

GAC TGC 
Asp Cys 
170 



GAC GTG ATC ATA TGC 
Asp Val He He Cys 

30 

CTT GGG ATA GGC ACT 
Leu Gly He Gly Thr 
45 

CTC GTC GTC TTG GCC 
Leu Val Val Leu Ala 
60 

CAC CCC AAC ATC GAG 
His Pro Asn He Glu 
80 

TTC TAC GGC AGA GCC 
Phe Tyr Gly Arg Ala 
95 

CTC ATC TTC TGC CAT 
Leu He Phe Cys His 

110 

CTG ACC AGC CTG GGC 
Leu Thr Ser Leu Gly 
125 

GTC GCC GTC ATC CCC 
Val Ala Val He Pro 
140 

GCG CTC ATG ACG GGA 
Ala Leu Met Thr Gly 
160 

AAC TCC GCC GTC ACT 
Asn Ser Ala Val Thr 
175 



CCC 
Pro 
185 

AGA 
Arg 



CGG 
Arg 



GTG 
Val 



ACT TTT ACC ATT GAG ACT ACC 
Thr Phe Thr He Glu Thr Thr 
190 

AGC CAG CGT AGG GGC CGC ACG 
Ser Gin Arg Arg Gly Arg Thr 
205 

TAT GTC TCG GCT GGA GAG AGA 
Tyr Val Ser Ala Gly Glu Arg 
220 

CTC TGT GAG TGC TAC GAT GCC 
Leu Cys Glu Cys Tyr Asp Ala 
235 * 240 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 
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GGA TGT GCG 
Gly Cys Ala 



CGC GCT TAC 
Arg Ala Tyr 



GAA TTC TGG 
Glu Phe Trp 
275 



TGG TAT 
Trp Tyr 
245 

ATA AAC 
lie Asn 
260 

GAG GGG 
Glu Gly 



GAT 
Asp 



ACC 
Thr 



GTG 
Val 



CTG ACT 
Leu Thr 



CCC GGG 
Pro Gly 



TTC ACG 
Phe Thr 
260 



290 

CCT GCC 
Pro Ala 
250 

CTC CCT 
Leu Pro 
265 

GGG CTC 
Gly Leu 
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GAG ACT ACC GTG AGG TTG 76 3 

Glu Thr Thr Val Arg Leu 
255 

GTC TGT CAG GAC CAT TTG 816 
Val Cys Gin Asp His Leu 
270 

ACT AAC ATC GAC GCT CAC 864 
Thr Asn He Asp Ala His 
285 



ATG CTG TCA 
Met Leu Ser 
290 



CAG ACC 
Gin Thr 



AAA 
Lys 



CAG GGT 
Gin Gly 
295 



GGG GAG 
Gly Glu 



AAT TTC 
Asn Phe 
300 



CCA TAC CTT GTA 
Pro Tyr Leu Val 



912 



GCG TAC CAA 
Ala Tyr Gin 
305 

TGG GAC ACA 
Trp Asp Thr 



GGC CCT ACT 
Gly Pro Thr 



GCA ACA GTC TGT GTT CGC GCG AAA GCG CCC CCC CCC AGC 960 

Ala Thr Val Cys Val Arg Ala Lys Ala Pro Pro Pro Ser 
310 315 320 

ATG TGG AAA TGC ATG CTC CGT CTC AAA CCG ACT TTA ACT 100 8 

Met Trp Lys Cys Met Leu Arg Leu Lys Pro Thr Leu Thr 

325 23C * 335 

CCC CTC TTG TAC AGG CTG GGG CCC GTC CAG AAT GAG ATC 10 5 5 

Pro Leu Leu Tyr Arg Leu Gly Pro Val Gin Asn Glu He 

340 345 350 



ACA CTG ACG CAC CCC ATC ACC AAG TAC ATT ATG GCT TGC ATG TCT GCG 
Thr Leu Thr His Pro He Thr Lys Tyr lie Met Ala Cys Met Ser Ala 
355 360 365 



1104 



GAC TTG GAG 
Asp Leu Glu 
370 



GTC ATT 
Val He 



ACC 
Thr 



AGC ACT 
Ser Thr 
375 



TGG GTT 
Trp Val 



CTG GTG 
Leu Val 
380 



GGG GGC GTT GTG 
Gly Gly Val Val 



1152 



GCG GCC CTG 
Ala Ala Leu 
385 

GGT AGG ATC 
Gly Arg He 



GCA TTA TAC 
Ala Leu Tyr 



GCG GCC 
Ala Ala 



ATC CTC 
He Leu 
405 

CAG CAA 
Gin Gin 
420 



TAC 
Tyr 
390 

TCT 
Ser 



TTT 
Phe 



TGC TTG 
Cys Leu 



GGG AAA 
Gly Lys 



GAT GAG 
Asp Glu 



ACG GTG 
Thr Val 



CCT GCC 
Pro Ala 
410 

ATG GAG 
Met Glu 
425 



GGT TCG 
Gly Ser 
395 

ATC ATT 
He He 



GAG TGC 
Glu Cvs 



GTA GCC 
Val Ala 



CCC GAT 
Pro Asp 



TCG GCC 
Ser Ala 
430 



ATA GTC 
He Val 
400 

AGG GAG 
Arg Glu 
415 

TCG TTG 
Ser Leu 



1200 



1248 



1296 



CCC TAT ATG 
Pro Tyr Met 
435 



GAC GAG 
Asp Glu 



ACA 
Thr 



CGT GCC 
Arg Ala 
440 



ATT GCC 
He Ala 



GGA CAA 
Gly Gin 



TTC AAA 
Phe Lys 
445 



GAG AAA 
Glu Lys 



1344 



GTG CTC GGC 
Val Leu Gly 
450 



TTC ATC 
Phe He 



AGC 
Ser 



ACG ACC 
Thr Thr 
455 



GGC CAG 

Glv Gin 



AAG GCT 
Lys Ala 
4S0 



GAA ACT 
Glu Thr 



CTG AAG 
Leu Lys 



1392 



CCG GCA GCC ACG TCT GTG TGG AAC AAG GCT GAG CAG TTC TGG GCC ACA 
Pro Ala Ala Thr Ser Val Trp Asn Lys Ala Glu Gin Phe Trp Ala Thr 
465 470 475 480 



1440 
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(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

Thr lie Thr Thr Gly Ala Ser lie Thr Tyr Ser Thr Tyr Gly Lys Phe 
1 5 10 15 

Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp Val lie lie Cys 
20 25 30 

Asp Glu Cys His Ser Gin Asp Ala Thr Thr He Leu Gly lie Gly Thr 
35 40 45 

Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val Leu Ala 
50 55 60 

Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asn lie Glu 

65 70 75 80 

Glu Val Ala Leu Pro Gin Glu Gly Glu Val Pro Phe Tyr Gly Arg Ala 

85 90 95 

lie Pro Leu Ala Phe lie Lys Gly Gly Arg His Leu lie Phe Cys His 
100 105 no 

Ser Lys Lys Lys Cys Asp Glu Leu Ala Lys Gin Leu Thr Ser Leu Gly 
115 120 125 

Val Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ala Val He Pro 
130 135 140 

Thr Ala Gly Asp Val Val Val Cys Ser Thr Asp Ala Leu Met Thr Gly 
145 150 155 160 

Phe Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Ser Ala Val Thr 
165 170 175 

Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Glu Thr Thr 
180 185 190 

Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly Arg Thr 
195 200 205 

Gly Arg Gly Arg His Gly He Tyr Arg Tyr Val Ser Ala Gly Glu Arg 
210 215 220 

Pro Ser Asp Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr Asp Ala 



1443 
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240 



Gly Cys Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr Val Arg Leu 
245 250 255 

Arg Ala Tyr lie Asn Thr Pro Gly Leu Pro Val Cys Gin Asp His Leu 
260 265 270 

Glu Phe Trp Glu Gly Val Phe Thr Gly Leu Thr Asn lie Asp Ala His 
275 280 285 

Met Leu Ser Gin Thr Lys Gin Gly Gly Glu Asn Phe Pro Tyr Leu Val 
250 295 300 

Ala Tyr Gin Ala Thr Val Cys Val Arg Ala Lys Ala Pro Pro Pro Se- 
305 310 315 320 

Trp Asp Thr Met Trp Lys Cys Met Leu Arg Leu Lys Pro Thr Leu Thr 
325 330 335 

Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gin Asn Glu He 
340 345 350 

Thr Leu Thr His Pro He Thr Lys Tyr He Met Ala Cys Met Ser Ala 
355 360 365 

Asp Leu Glu Val He Thr Ser Thr Trp Val Leu Val Gly Gly Val Val 
370 375 380 

Ala Ala Leu Ala Ala Tyr Cys Leu Thr Val Gly Ser Val Ala He Val 
385 390 395 400 

Gly Arg He He Leu Ser Gly Lys Pro Ala He lie Pro Asp Arg Glu 
405 410 4i5 

Ala Leu Tyr Gin Gin Phe Asp Glu Met Glu Glu Cys Ser Ala Ser Leu 
420 425 430 

Pro Tyr Met Asp Glu Thr Arg Ala lie Ala Gly Gin Phe Lys Glu Lys 
435 440 445 

Val Leu Gly Phe He Ser Thr Thr Gly Gin Lys Ala Glu Thr Leu Lys 
450 455 460 

Pro Ala Ala Thr Ser Val Trp Asn Lys Ala Glu Gin Phe Trp Ala Thr 
465 470 475 480 



Tyr 
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